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Classification Is one of the fundamental 
concerns of science. Facts and objects 
must be arranged In an orderly fashion 
before their unifying principals con be 
discovered and used as the basis for pre- 
diction. Many phenomena occur in such 
variety and profusion that unless some 
system Is created among them they would 
be unlikely to provide any useful Infor- 
mation. 



SOKAL, 1966 



^ S‘ 



FOKMAT C iHi IOAB 9 10 X 9 SHSTPATUM 
oGT; 0.0 )"G0~T0“4 



6A2 / ) 



o' 

ERIC 



IF ( T ( I ) 

PHINT 33 

FORMAT ("7/~roXT~SH EMPTY )" 
GO TO 5 
T I=~i 2V1 









r 



DO 41 J=Il9 

IF ( map ( J i“NEVT~)‘”eo~ro~«' 












\ 



PREFACE 



What is the subject matter of this report? 

This report describes an empirical approach to the study 
of organizational and demographic characteristics of school 
districts. The particular approach set forth is the result 
of research work which was initiated as ”A Study of Certain 
Characteristic Patterns of Elementary School Districts in 
Wisconsin" , The initial conceptualization and plan of the 
study were relatively restricted^ but as the work progressed 
and certain methodological problems were solved it became 
clear that the empirical approach and technical developments 
had considerable generality in terms of their applications 
and implications. 

For the purpose of stating clearly the broad applications 
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and implications of the Tesearch methodology) the conceptuali 
zation of the project was reformulated as "Multivariate Pro- 
cedures for Stratifying School Districts" , This conceptuali- 
zation is reflected in the organization of the report) which 
\ consists of three major parts: 



Part One discusses the scientific issues and concep ^s 
concerning school districts as a population 
of organizational entities) and the technical 
and statistical problems involved in studying 
a populatioyi or sample of these entities. 

Part Two presents the statistical methods and techniques 
appropriate for the study of a population of 
school districts) and the data processing pro- 
cedures necessary for manipulation of avail- 
. able information. 



Part Three provides several exemplary applications which 
demonstrate the broad utilii^ of the approach 
and the flexibility of the general procedures 
when applied to specific research problems. 



Of what importance is the research reported herein? 

A first glance at the body of this document will under- 
standably dismay the non-technical reader. The text is laced 
with technical terms such as "descriptive complexity") "repli- 
cdbility" ) and "generalizability". So-called jargon) on first 
sight) often evinces doubt about the meaningfulness and relevance 
of the research from the viewpoint of practicing educational 
administrators. Certainly the non-technical reader cannot be 



expected to digest and assirrri'Late the vcvrious "gymnastics” 
of statistical manipulations such as those discussed in 
Paxt Two, . 

The apparent complexities of the mathematical and 
statistical manipulations should not^ however cloud the 
fundamental issue to which this report is addressed^ for 
these issues have great importance for administrators in 
various types of educational organizations. 

Practicing educational administrators are repeatedly 
confronted with problems involving a large number of educa-~ 
tional collectivities, for example the director of a 
Regional Educational Laboratory is concerned with numerous 
and varied school districts contained within his territory. 

On one hand^ it is impossible for him to attempt to plan an 
activity in which each individual district will be considered 
as a discrete entity with its own individual characteristics , 

On the other hand^ it is foolhardy for him to consider dealing 
with the population of districts as though they were homog- 
enizedi or as if each was highly similar to the others. 
Practicality requires that he find some way of differentiating 
for a variety of purposes^ among districts so that those 
having similar characteristics are grouped together while 
those which are dissimilar are separated. One approach to 
solving such a problem has been to classify districts, according 



to student enToHment size. Such a olassificotion achieves 

a simplification of the situation and the complexity is Teduced. 

Ecwever*^ as most pr*acticing administrators are rnoare^ such cm 

approach tends to oversimplify the situaticm. It fails to 

recognize other important characteristics of the districts^ 

\ 

such as the socio-economic milieu within which they operate^ 
the internal complexities of the agencies ^ and the demographic 
character of the school attendance areas. 

The need for differentiating among school districts is one 
of the basic problems faced by state officials responsible for 
the distribution of state aids to local school districts. The 
key question- posed by this problem is: How can the individuality 

of each distr-ict be recognized in a classification and/or ac- 
creditation scheme which will allow effective and equitable dis- 
tribution of fundsj consistent with the objectives and criteria 
of a state support program? In Wisconsin, for example, a tri- 
chotomous classification of districts has served for a long time 
as the basis for distributing state aids. Historically, use of 
these three classes has achieved the purpose of supporting, 
stimulating, and motivating local districts in the improve- 
ment of their educational programs. Now, however, over 6Q7o 
of Wisconsin districts fall in the same category and are 
treated as if their needs were equal and of the same kind. 

This report does not provide any specific, improved solution 
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to this general problem. It does, hmiever, sharply focus 
attention on the problem of constructing district classifi- 
cation schemes, by describing and demonstrating an approach 
to classification and/or accreditation which takes into account 

the individual characteristics of each district. 

*% 

The crux of the classification and/or accreditation problem 
is that it is not feasible to treat each district as an individual, 
discrete entity by developing idiosyncratic ruUs for say, dis- 
tributing state aid. Nor is it efficient, effective or fair 
to treat all districts as though there were no differences 
among them. District needs vary over the years due.to changes 
in the school- community being served and in the organizational 
character of districts. These comments are not meant to imply 
that the problem of classifying districts has been neglected, 
or that it is an unstudied problem. Concern for differentiating 
among districts is continually evidenced. For example, a 
practicing administrator may say "lou don't launch a new pro- 
gram of in-service teacher training in 'down-state ' districts 
in the same way that you initiate such a program in our largest 
city". But haw can the individuality of districts be em- 
pirically recognized and pragmatically taken into account? 

The purpose of this pre factory disoussion—and indeed of 
the total report— is to emphasize the importance of recognizing 
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that there are many oharaateristics which differentiate school 
districts j and that they should he and can h'' used for practical 
purposes. Furthermore, it is important to establish operationally 
and emp'Lxn^cally the exact nature of the differential relation— 
ships which exist among a set organizational entitites such as 
school districts. In this vein, the basic function of this 
report is to describe a particular set of tools useful in the 
pragmatic consideration of school district individuality. The 
exact procedure: 8 described in this document are not recommended 
as ultimately ideal solutions, hut they do lepresent exeynplary 
solutions to the basic problem. And, they demonstrate that em- 
pirical and systematic approaches can he made to the difficult 
and complex— b,.t important— task of accounting for individual 
differences within a population of school districts. 

The utilities afforded by an appropriate, empirical classi- 
fication of school districts are numerous and varied. Several 
possible uses are discussed in Parts I and II of this report 
and are illustrated in Part III, 

\ 

One of the most noteworthy uses of a stratification 
(classificiation) scheme is in economizing the study of school 
districts, in terms of time, effort and money. For example, 
the need for securing information about an entire state edu- 
cational system does not mean that all districts in the state 



must be surveyed, A stratified random sampling scheme con 
be used for gathering information from a fraction of the 
district prpulationj and only minute losses in the precision 
of summary data will occur. The use of such efficient sampling 
techniques is accompanied by significant reduction in the costs 
\ of information collection. One example of this type of utility 

is given in Part III Section D of this report. 



From what viewpoint and background was this research undertaken? 






The style and tone of this report might understandably 
suggest to the reader that the research was an exercise in the 
manipulation of a grand conceptualization invoking a methodological 
statistical paradigm. The initial formulation of the research 
plans arose a$ a response to a particular problem concerning the 
selection of a sample of elementary school teachers in the state 
of \Jisconsin (see Part Three), Initial discussions for the pur- 
pose of achieving this goal took place in the fall of 1962 be- 
tween staff members of the Wisconsin State Department of Public 
Instruction and a research team of the University of Wisconsin 
I^,struotional Research Laboratory, During the following twelve 
months i much of the preliminary planning and preparatory work 
was accomplished and was culminated by the submission to the 
U, S, Office of Education of the proposal for the project as 



%-t was irnitiated in June^ 1965, This preliminary work involved 
preparation of the specific details of the research plans ^ design 

of the data matrix^ methods of applying the analytic results to 
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eampling problems^ decisions about variables which would form 
the input for the computational analysest and outlines and 
tests of some of the necessary computer programs. 

Thus the approach to investigating school disti^cts de- 
scribed in this report evolved inductively from continuing con- 
sideration of a particular problem until the broad^ general im- 
plications of the approach became apparent to the research team. 
Several comments concerning the history and developments of the 
research work are appropriate for the purposes of indicating the 
favorable conditions under which the project was executed^ and 
of describing some of the logistic difficulties which the 

researchers encountered, 

» 

The planning activities were co-incident with the develop- 
ment of a data processing system in the Wisconsin State Depart- 
ment of Public Instruction (WSDPI), Without this system^ the 
organization^ tabulation ^ and storage of the input data would 
have been a task of such magnitude that it is quite likely that 
the work reported here would never have been undertaken. Data- 
mation has exerted a significant impact on the management of 
education. However^ the technical aspects of such systems are 
frequently not understood by personnel inexperienced in the 
logistics of automatic computing machinery. Some of these 
tecJmical difficulties confronted the researchers for the duration 
of' the project. For example^ the WSDPI data processing system 
used IBM machinery i while the University of Wisconsin Computing 
Center used a CDC 1604 system. These two computer systems are 
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not pTeovseZy Qornpo.tthZe for wrtt'ing and reading data stored 
on Tnagnetvo tapes— -a probZem which can be corrected by improve- 
ments in the manufacture of the eZectronic mechanisms. One 
specific difficuZty of this kind arose when data written with 
IBM equipment had to he read by a CDC mechanism. This re- 

suZted in errors of various kinds during transfer of tape-stored 
data. 

Such difficuZties can be soZved, But the soZutions take 

time and continuous monitoring of computations^ so that mistakes 

can be identified and corrected. It shouZd be noted, however, 

that such difficuZties are reZativeZy minor when the aZtemative 
# 

is considered:, the manual manipuZation of thousands of punched 
cards gathered from severaZ different sources. 



Another pertinent historicaZ comment concerns the anaZytic 
methods and their computationaZ appZications , The procedures 
described inike report invoZve the appZication of compZex muZti- 
vanate statisticaZ techniques for the purpose of summarizing a 
Zarge quantity of data entries and variahZes, The anaZytic 
design required the computation of exact distributions of factor 



scores; two computationaZ techniques were necessary to construct 
the scores. The psychometric basis for one of these techniques 
depended on work which C, W, Hams compZeted in 1962, Hariris ^ 
work extended a basic mathematicaZ formuZation of L, Guttman 
which was pubZished in 1953, The appZication of these statistica 
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procedures to the study of school districts, as described in 
this report, is believed by the authors to be the first such 
use in educational research. 
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Section B - Objectives: The Desired Outputs of the 
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PART I 



SECTION A 

RATIONALE: THE NEED FOR A MULTIVARIATE STRATIFICATION SCHEME 

The Local Education Agency (LEA or school district) is a focus of study or a 

sampling unit in numerous and varied educational research investigations. There are 

s ^ 

two general reasons for researchers being concerned with the LEA. 

One reason is that district sampling may be a necessary step in an hierarchial 
sampling plan, the final target of which is schools or persons. Because of administrative 
considerations, such a plan requires that a district (or several districts) must first be se- 
lected in order to clear channels for gaining research access and cooperation. This is 
especially true if district personnel are to be active in the research process (e.g. 

Ryans, 1960, or Carter and Sutloff, 1960). 



see 



A second reason involves the direct study of district characteristics . Occasion- 
ally differences among districts are stated to be an important factor, or major source of 
variation. A preliminary goal of some such research is to describe district characteris- 
tics in a limited empirical fashion. Some studies have pursued this in depth, endeavor- 
ing to define or discriminate among "types" of districts using measures based on some 
broadly conceived criterion variables. One of the more important and popular kinds of 
criterion variables is the adaptability of various types of school systems — often defined 
in terms of organizational tronsformation or of the adoption of educational innovations. 

An early and well-known study which endeavored to discriminate among types of dis- 
tricts on an adaptability criterion variable was executed by Mort and Cornell (1941); 
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Reported investigations are distributed throughout much of the educational research 
literature; two sources in which they are frequently cited are particular issues of the 
Review of Educational Research, entitled "Educational Organization, AdmihistjOtion 
and Finance" (Uctober, IV6I and October, 1964). 
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a more recent stud/ with a similar purpose has been reported by Carlson (1965). 

But a review of relevant literature indicates that approaches to sampling or 
studying school districts in educational research can be generally characterized as un~ 
systematic or otherwise inadequate (see Cornell, 1960). The problem confronted by 
this project was to develop an approach to the characterization and sampling of LEAs 
which would overcome or circumvent certain typical methodological pitfalls. A syste- 
matic methodology for studying LEAs is becoming increasingly necessary because LEAs 
are becoming increasingly important; the local district has become more powerful as the 
organization through which educational policies and techniques are developed and 
channeled. Also, the development of a systematic methodology is now feasible be- 
cause of progress in the applications of two research tools: one, high-speed electronic 
computing machinery and associated data banks and, two, new multivariate data 
analysis techniques. 

t 

In the remainder of this section, some common research practices are de- 
scribed and exemplified and inferences are drawn which provide a background for dis- 
cussing the particular multivariate approach developed in Part II and demonstrated in 
Part IN. Section B of this Part is given to a description of project objectives, and 
Section C provides an overview of the development and use of the methodological 
algorithm. 

This section has been titled, "Rationale; The Need fora Multivariate 
Stratification Scheme". Establishing such a need has two facets; first, the preceeding 
paragraphs have indicated that a systematic methodology for studying LEAs is important 
in educational research; second, the remainder of the section will indicate that such a 
system is not found in the reported literature. But the concern in the review of litera- 
ture will not be for "criticisms", in the negative sense of the word. Rather an attempt 
is made to educe and evolve from the literature criterio Tor development of a methodology 














4 

Inter-district variation is not always an acknowledged source of variance in 
educational research, but it is an implicit one in any study which treats districts as 
sampling units cr observational entities. For instance, many studies collect Ond process 
data from several districts to ensure "representativeness", but their reports do not in- 
clude information about the dimensions used to sample districts— if, indeed, dimensions 
wo(o used— or about the, variation of the criterion characteristics among the districts. 
Typical of sampling descriptions given in such reports is the statement that "Figures 
below are based on the endorsements given to variations of attitude by fifty northeastern 
New Jersey secondary school principals" (Berthold, 195]), or "The answers to thes<i 
questionnaires provide basic information concerning 433 parochial elementary schools 
in 29 states. . . Teachers reported on 41 different schools while they attended the regular 
summer session. There is no reason to believe that this is a skewed sample of parochial 
school teachers, or that these women are biased in a way that other similar teachers 
would not be" (Fichter, 1958)'. When reviewing the literature on the social backgrounds 
of teachers, Charters (1963, p. 771) concluded that "it is impossible. .. .to compare ths 
results of one study with another. '• 
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Whether s/stematic individual differences among districts affect research 
results exp licitly or implicit/, the results of studies which involve the school district as 
Q sampling unit are typically difficult to generalize beyond their particular contexts. 
Also, it is usually difficult to compare results across such studies. Such limitations are 
related to three general classes of research manipulations: 1) describing school districts, 
2) comparing results of separate studies, and 3) selecting dimensions with which to dis- 
criminate among districts. These are three distinct but related methodological issues, 
and they are respectively related to theoretical concerns for 1) sufficient descriptive 
complexity, 2) replicability, and 3) generalizability. These three points ore given here 
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not only os criticisms of typical research, but also as criteria for the stratification 
system developed in this study and specifically described in Parts II and III. U is there- 
fore necessary to explicate the importance of each point. The explication is given in 
the remainder of this section and a summary is presented in Table 1 . 

SUFFICIENT DESCRIPTIVE COMPLEXITY 

Dimensions typically used for describing and discriminating among districts 
are inadecjuatej research studies are limited by definitions of descriptive dimensions, 
and by oversimplified district characterizations. If the study of school districts is to be 
scientific in any sense, then an empirically rigorous taxonomic system is required for 
describing and differentiating districts. An adequate class'ficatory system is no less 
necessary in the study of institutions than in any other field of scientific inquiry: 

Classification is one of the fundamental concerns of science. 

Facts and objects mUst be arranged in an orderly fashion before 
their unifying principles can be discovered and used as the basis for 
prediction. Many phenomena occur in such variety and profusion 
that unless some system is created among them they would be un- 
likely to provide* any useful information (Sokol, 1966). 

The act of making discriminations among districts~even if the distinctions 
ore only dichotomous-—necessitates the definition of a dimension of discrimination. In 
the process of iaentifying, labelling, and working with such a dimension, the researcher 
is frequently faced with a dilemma. On the one hand, a researcher might choose a 
replicable and operationally sound dimension which oversimplifies the differentiations 
among districts, with a corresponding loss of meaningful ness. On the other hand, if 
he wishes his procedures to reflect more accurately the complexities of district charac- 
teristics, he finds it difficult to define dimensions which are operational and manipulable. 
For instance, an investigator may feel that "district size" is an important distinction 
to make among respondents in his study. Though enrollment totals are typically employed 




TABLE 1 



s. 



Summary of Methodological Issues 



Issue 


Hazard 


Criterion 


1 • OescripHon 


Insufficient characterization of 
LEAs and failure to utilize rele- 
vant information, resulting in 
oversimplification and disregard 
for the complex structure of 
these organizational entitles 


Sufficient DescrifMve Complexity 


2. Comparison 


Ad hoc categorization of LEAs 
oTTd Ill-defined and un reported 
selection procedures resulting 
in the indbility to test and ac- 
cumulate knowledge. 


Replicability 


3. Selection 


A posteriori classification of 
LEAs, resulting in the confound- 
ing of district differences with 
other dimensions peculiar to the 
particular sample. 


Genera lizab ill ty 
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as sfrafificafion variables in such studies (for instance, see Ryans, 1960; Schunert, 

1951; or Barnes, 1961), the investigator may feel that other indices of size— such as 

the number of teachers employed in the district, or the number and capacity of school 

buildings— are also important indicators. There would be no problem in such a case if 

the various Indicators of "district size" were perfectly correlated, but thb is infrequently 
tru^ . '•* 

A criterion, then, for procedures which characterize, categorize, t)r stratify 
school districts Is that they incorporate sufficient descriptive complexity and Thereby 
acknovdedge the multivariate nature of school district characteristics. ^ 

REPLICABILITY 

Procedures for categorizing districts are often project-unique. When researchers 
need to establish procedures foi‘ discriminating among districts, they often rely upon ad 
lioc rational schemes for district characterization. Such schemes are often based on the 
nature of the particular content being researched, and on the investigator's familiarity 
with the districts in which he works; consequently, the assignment of districts to strata, 
types, or clusters is far too often based on the use of impressionistic obseivations. 

An example of a selection scheme which would be very difficult to replicate may 
be found in Pierce's (1947) study of factors related to adaptability. His sample of dis- 
tricts was recruited from the "Metropolitan School Study Council", which is 

* • *L° L of relatively wealthy communities located, 

with the exception of a small number . . ., within the metropolitan 

including eight communities from within 
the city ... It was not possible in the final analysis to include 
every community in each phase of the study due to incomplete 
data on some of the measures. Conclusions and statistical annlvcAc 
are based for the most part on data from 48 communities. ' 
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It seems appropriate to point out the richness of this 
setting for a study of this kind. Probably no other group could 
afford a better background for the study of factors which may be 
related to making good schools . . » 

Clearly, the selection of the "sample" for this study was determined more by con- 
venience than by an a priori proscribed rationale. It is also clear that any investigator 
v/h 9 wished to replicate^the study would have to seek out those same 48 communities 
and their districts; he has no basis for defining an independent, but somehow comparable, 
sample of districts or communities. 



Brur.r er (1957) later wrote that Pierce had 

. . . shown most elaborately through statistical techniques that 
what a school is is determined far more by what the community 
is than by what goes on within the four walls of the school 
building (p. 79). 

Such a conclusion seems hasty,, in view of the ambiguity of the school selection procedure 

» 

used in the study, and in view of the fact that Pierce did not directly compare the impact 
of community characteristics with that of "internal" school characteristics. 



A study by Duncan and Kreitlow 0954) is another example of one which would 
be difficult to replicate because the manner of defining the sample is unclear. In this 
study. 



. .. data were obtained by personal interview in 38 rural neighborhoods 
located in . . . [various parts 1. . .of Wisconsin. . . The neighborhoods 
were selected so as to constitute 19 matched pairs, one in each pair 
being homogeneous in ethnic and religious characteristics and the other 
heterogeneous in these respects. The two neighborhoods in each pair 
were matched on. . . [socio-demographic] characteristics. The 19 pairs 
represent a range of agricultural land types, types of school system and 
specific major ethnic-religious groups in the state (p. 350). 

If another investigator wished to replicate this study in a different location, he would 

need much more information about the identification and classification of "neighborhoods" 

than that provided by these authors, such as: What are the characteristics of the popu- 
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lotion of neighborhoods from which the 38 were selected? Exoctly whot wos the 
selection procedure used in identifying the 38? Whot ore the operotionol definitions 
of ond the criterion cut-offs for "homogeneity" ond "heterogeneity"? Whot wos the 
Attrition rote; thot is, how mony neighborhoods had to be excluded from the study be- 
cause they didn't 'match up' properly? 

In o study such os Duncan and Kreitlow's, o more attractive alternative to 
the sampling problem is to; 1) identify o population of 'neighborhoods', 2) develop a 
theory-based but operationally-defined scheme for classifying neighborhoods os 
'homogeneous' or 'heterogeneous', 3) partition the entire population into these categories, 
4) draw random samples of neighborhoods from each of these two clusters, and 5) use 
demographic variables as covariates, if statistical control o-'er them is desired. 

Although it is usually methodologically sound for the categorization or stratifi- 
cation of districts to be based upon variables which have apparent theoretical relevance 
to the content of the research, it is not desiiable for these procedures to be largely 
determined by the idiosyncratic aspects of individual studies, for the cumulation of 
research findings is attenuated or prohibited. The criterion of replicability, then, fora 
general system which characterizes, categorizes, and stratifies school districts is that 
the system allryws the cross-study comparison and integration of results which ore not now 
possible. That is, the apphcotions of the system should be replicable. Of course, the 
system must permit the selection of dimensions (within the system) considered relevant 
for particular research interests. 

GENERALIZABILITY 

Prior selection of districts to be studied often determines dimensions for cate- 
gorizing districts. For purposes of this discussion, generalizcbility is equated with the 
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avoidance of bias due fo sample selection procedures. One type of biasing condition 
consists of identifying a few (sometimes only two) districts which will grant access to 
do research, executing the research and finding differences between or among districts, 
and then comparing districts on the basis of descriptive characteristics in an effort to 
find some hindsightful way to explain the results. 



A recent study of problem solving among elementary school teachers (Turner, 
1964) describes a procedure for selecting school districts which may impose stringent 
limits on the general izab i I ity of the outcomes. Districts which might have cooperated 
in the research were selected on an "invitational" basis. Al first, 25 Indiana systems 
were invited to participate in the research; twelve of these either did not accept the 
invitation or weie excluded because they did not employ any inexperienced teachers- 
who were to be the primary respondents for the research. Other districts had to be dis- 
carded from some subsequent analyses because their few beginning teachers did not stay 

I 

for the two-year duration of the study. The basis for selecting the districts to whom the 
original 25 invitations were sent is not made clear in the report. After the study was well 
under way, the investigators developed a method for "typing" the 13 participating dis- 
tricts, based on equalized property valuation, and the ratio of working-class to middle- 
class children in a district. The purpose of this phase of the project was to djtermine 
the relationship between the "type" of district and selected characteristics of teachers 
in the districts. 



Inferences about such relationships might have been stronger had the investi- 
gators first identified a population of districts and "typed" each of them, and then 
selected districts from types (preferabi y at random) until they had secured enough LEAs 
which employed first-year teachers. It would remain to persuade the sampled districts 
to cooperate. As it happened, the criteria of having the right kind of teachers and being 
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willing to cooperate were applied prior to the sampling of districts, and the "typology" 
procedure was thus applied to a small and probably biased sub-set of the population of 
districts. G;tib-group samples are not likely to be representative with respect to 
measured characteristics, because "grabability" is doubtless correlated with other im- 
portant qualities which differentiate districts. 

Limitations on generalizability also occur when cases from a population of dis- 
tricts are eliminated because data are inconvenient. For example, study by Terrien and 
Mills (1955) contains a sequence of eleminations of districts which certainly biased the 
outcomes of the research,but in some unspecified way: 

Of the 1747 elementary districts in California, data were secured on 
732: Because of the fact that in organizations of less than ten persons 
one person does several |obs of both an administrative and non -admini- 
strative character, it seemed Justifiable to remove from consideration 
those 468 elementary districts in which the total organization numbered 
less than ten .. Of thfe 245 high school districts, data were secured on 
100. . . four districts were less than ten persons in size, and these were 
removed .... 

Such a practice would be less serious if relevant differences between the "eliminated" 
group and the "retained" group could be described. However, such information is not 
made available; in fact, the possibility of bias by differential elimination cannot be 
detected because selection and elimination procedures are not fully discussed. 



Such limitations are serious because dependent variables are frequently correlated 
with relative availability of data . Almost always, some differences can be found among 
districts which are correlated with and appear to explain the results of the research. Such 
policies are variously known as “taking advantage of chdnce", "fishing in polluted data", 
"exploitation of the grab group", or "a posteriori hypothesizing" . By any name, the 



practice violotes a host of principles of sampling procedures and scientific and statistical 
« 

logic. 
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A criterion/ then, fora system which characterizes, categorizes, and 
stratifies school districts is that the application of the system be unbiased and general iz- 
able. The advantage of such a system is that its availability might encourage analytic 
procedures more sound than the search for descriptive schema which will "discriminate" 
among a few districts selected for study because of their proximity, or willingness or 
ability to cooperate. 



\ 






PART I 



SECTION B 

OBJECTIVES: THE DESIRED OUTPUTS OF THE PROJECT 

The goal of the research described in this report was to provide for the 
empirical characterization and categorization of a defined population of school 
<iist ricts. 
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The two dominant objectives were therefore to develop one general system 
for characterization and stratification of LEAs and to illustrate the utility and appli- 
cability of the system. The system was conceived and formulated as a general 
methodological algorithm for measuring, stratifying, and sampling districts from 
a population. Because the investigators considered districts to have multivariate 
complexity, multivariate analyses have been the bases of the methodology. The basic 
outputs of the algorithm, the’refore, are multivariate descriptions and measures of 
districts and multivariate sampling paradigms. 

Three kinds of criteria shaped the development of the algorithm: A) Funda- 
mental Scientific Criteria, B) Research Utilization Criteria, and C) Practicality 
Criteria. Detailed discussion of these three kinds of criteria are given in this section, 
and a summary is given in Table 2. The derived algorithm is operationally described 
in Part 1 1 , and illustrations and applications are given in Part I II . 

FUNDAMENTAL SCIENTIFIC CRITERIA 

The scientific criteria have already been discussed in Section I ,A. They 
are a) sufficient descriptive complexity, b) replicability, and c) genera I izability. 



The definition for algorithm in this report will be that given in the Random House 
Dictionary of the English Language: "any particular procedure for solving a certain 
type of problem", where, in this case, the problem is the empirical stratification 
of a population of school districts. 



TABLE 2 



Kind 



A. Scientific Criteria 



Summary of Criteria 



Criterion 



1 . The dimensions should reflect the multidimensional 
complexity of districts. 

2. Applications of the algorithm should be repli- 
cable. 

3. Outcomes of those applications should be 
generalizable and unbiased. 



B. Utilization Criteria 



i 



4. The algorithm should allow precise and concise 
descriptions of districts. 

5. The algorithm should allow comparison and cate- 
gorization of districts. 

6. Analytic comparison with outside variables should 
be possible. 

7. The algorithm should facilitate sophisticated i 

sampling for a wide range of studies concerned j 

with school district variability . 



C. Practicality Criteria 



8. System data inputs should be extant and im- 
mediately available. 

9. System outputs should be meaningful and directly 
useable. 

10. Special system outputs should be obtainable for 
particular research interests. 



£. 
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In Section I.A they were stated to be guiding principles for the design of the 

algorithm* Here^ further comments will be made concerning certain practical 
of the criteria. 



aspects 



Sufficient Descriptive Complexity . School districts are political, geographical, 
administrative, sociological entities. Within practical limitations, the array of input 
information should be selected and prepared to reflect and indicate as many aspects as 
possible of school district characteristics. And in summarizing and compositing the in- 
put information, care should be taken to avoid oversimplification. 

Replicability . To make comparitive and longitudinal studies possible, it is 

desirable to be able to update the algorithm periodically. For input availability and 
structure and the composition of the population change across time. Also, it should be 
possible to transplant the algorithm to other populations where similar inputs are avnil- 
able. Then, for example, studies made in Wisconsin could be compared to studies 
made elsewhere. 












•I 



Generalizability . The algorithm should be comprehensive. That is, all the 
units in the target population (for this study, all the elementary school districts in 
Wisconsin) should be included. And all dimensions arising from the algorithm should 
apply to all the units in an unbiased way. 



RESEARCH UTILITY CRITERIA 



The use of multivariate methods in the exhaustive application of a comprehensive 
paradigm to a population of districts should yield dimensions which have utility for 
these general research processes; description, comparison, logical analysis, and 
sampling. In the following paragraphs these processes are briefly discussed: Their re- 
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qulremenh are stated as criteria in the development of an algorithm; and, as illustra- 
tions, previews ere given of the ways in which the present algorithm meets the criteria. 

Descr ip tion . An algorithm should uncover meaningful dimensions on which the 
units vary, dimensions which are intrinsically valuable for developing and verifying 
theoretical notions on the nature of school districts. In the present Jtudy, the wide range 
of input information was reduced to a small number of independent and meaningful 
dimensions. 

Comparison. It should be possible to partition the population of districts into 
clusters such that the clusters are homogeneous internally and heterogeneous with respect 
to one another. One clustering specified by the present algorithm is described in 
Section III.B. Furthermore, the present algorithm allows specifying new clusterings based 
on any selection of the characterizing dimensions. Qualitative and Quantitative comparison 
can then be made between clusters or "types" of districts. 

Logical Analysis . It should be possible to analyze the dimensions derived with- 
in the algorithm as possible sources of variation in understanding and explaining the dis- 
tributions of other variables across the districts. Deming (1953) makes a useful distinction 
between analytic and enumerative uses of research data; "In the analytic problem, the 
action is to be directed at the underlying causes that have made the frequencies of the 
various classes of the population what they are, in order to govern the frequencies of 
these classes in time to come." Cornell (1954) demonstrates that most educators are in- 
terested in analysis. The present algorithm can be utilized to examine relationships 
between the complex, derived dimensions for characterizing and differentiating dis- 
tricts ( independent variables) and other dimensions which have intrinsic value for 
particular research interest (dependent variables). This utility, of course, will depend 

on the meaning or interpretability of the derived dimensions and on their theoretical | 

. I 

relevance to the dependent variables. i ! 

, i 
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Sofnpling TschniguGS* The olgorithm should fdcliltcits sophlsticofed sompllng 
for o wide range of research opplicorions in which school district variability is of con- 
cern. This concern may be for the need to define an especially efficient or especially 
generalizable sample. Or this concern may be for defining a sample which is highly 

variable. The present algorithm allows the selection of stratified samples which meet 
these needs/ as iilustrdted in Section ill.D. 

PRACTICALITY CRITERIA 

While methodological criteria are concerned with the various kinds of research 
goals for which an algorhfim should be useful, criteria of practicality or feasibility are 
related to the probability that the system will be implemented. If the establishment of 
the system is extraordinarily cumbersome, involved, and tedious, there is little chance 
that it will be implemented, no matter how imposing is its justification on theoretical 
or methodological bases. Because of the availability of automatic computing machinery, 
implementation need not be prevented by the necessity of massive amounts of purely 
mechanical operations. But implementation requires information to be available and 
well designed procedures for trearing it to be designed. Three practicality criteria, 
then, for an algorithm which characterizes, categorizes, and stratifies school districts 
are: a) availability of inputs, b) meaningful ness of outputs, and c) variety of outputs. 
These criteria are explained in the paragraphs which follow: they are at first 
stated negatively, as possible obstacles to the development of an algorithm; then, as 

illustrations, previews are given of the practical solutions possible within the present 
algorithm. 

Avoilobility of Inputs. There would be lirrie advantage in developing an 
olgorithm which requires the planning and executing of a special large-scale data 
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collection venture solely for the purpose of stratifying ^ The data for the present 
scheme ore collected and filed in a data library which is at present sparingly used 
for any research work. The set of inputs is comprised of this data and certain pro- 
grammed transformations of it. 



Meaningful ness of Outputs, Multivariate analyses sometimes produce com- 
\ \ 

posites which are highly efficient representations of original data but which are very 
difficult to interpret. If a multivariate system of data reduction yields essentially un- 
interpretable results, it is very unlikely that it will be employed as a methodological 

U 

tool for field research. In the present study, two different multivariate techniques 

were tried, and the one which produced more interpretable dimensions v/as selected. 

Also, the computer program which derives the stratification clusters was designed to 

produce summary characteristics of the districts in each cluster; thus stratifications are 

» 

not presented as a bare struc rre but rather as annptated typology. 



Variety of Outputs , No single stratification, or set of dimensions, is suffici- 
ent for use in all research problems. The present algorithm allows researchers to select 
from a variety of dimensions those considered relevant to their particular interests. 

An illustration of such selection is given in Section III.C. 
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SECTl ON C 

OVERVIEW: THE ALGORITHM AND ITS EMPIRICAL PERSPECTIVE 

An algorithm for characterizing and categorizing the population of Wisconsin 
el ementary school distr lets has been designed and performed. It was inspired by the needs 

discu'hsed in Section 1 .A and its development was guided by the criteria and objectives 

j'' 

' stated in Section 1 .B. The algorithm is a series of operations, and these operations are 

exhaustive! y specifi ed in Pdrt II. In Part III the appi ications of the outputs of the al- 
gorithm are presented. I n this section an overview of the algorithm is given and is followed 
by a perspective on the design, operation, interpretation, and evaluation of the algorithm. 

) 

OUTLINE OF THE ALGORITHM 

The general algorithm’is summarized in Table 3. It has four major components and 

j t 

each component has a list of associated operations. The summary outlines the series of 
operations which have been performed with respect to the population of Wisconsin elementary 
school districts. The summary is general: a complete specification of the algorithm is pre- 
sented in Part II . In the paragraphs which follow, some comments are made about the major 
components of the algor ithm, and references are made to the relevant sections of Part 1 1 . 



Determine and Manipulate I nput Array . The algorithm does not involve e special 
data generation program, for jse is made of banks of data created and maintained for other 

« purposes, such as accounting. Therefore, when the population is defined, the next step is 

to search out and identify what data is available and to organize and collate it for the pur- 
pose of determining what can be extracted from it. The process of extraction requires in- 
genuity in manipulation and is guided by substantive theorizing, indicator variables result 

J * 

j from the extraction. For the present study, the results of the search for data and the specifi- 

cation of the indicator variables constructed are presented in Section II. A. 
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TABLE 3 

SUMMARY OF THE ALGORITHM 



Components of the Algorithm 



Operations within the Components 




<■ 



define population 
identify available data 
collate data files 
construct indicator variables 



^ 



investigate indicator distributions 
transform indicator array 
determine indicator interrelations 
adjust indicator array 







determine, indicator correlations 
investigate factor structure 
select factor model 
compute factor measures 







interpret factor structure 

investigate distributions of factor measures 

form primary multivariate stratification 

study primary strata 

design analytic applications 

devise sampling schemes 



me 
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Investigate and Regularize Indicator Variables. The distributions and the inter- 
relations of the indicators must be displayed and checked. The later operations require com< 
plete data and at least on approximation to normally distributed, linearly interrelated in- 
dicators. Thus transformation and adjustment of the indicator variables may be necessary. 
For the present study, some values of some indicators were missing—that is, unavailable 
or unWputable. The investigation and resolution of this problem is discussed in Section 



II.B. 



Construct Composite Measures. The variables derived from the source data ore 
conceived of as indicator of the important dimensions along which the organizational units 
vary. So when the indicator array has been produced and regularized, the algorithm calls 
for a investigation of the multivariate structure of the indicators. From the indicator correla- 
tion matrix, a number of multivariate models are calculated. The most meaningful and 
parsimonious is selected, and the measures corresponding to the independent, uncorrelated 
factors of this model are ccmputea. That is, from the many indicator variables, a few im- 
portant composite measures ore derived. The techniques employed are discussed in Section 



I.C. 



] 



De sign, Prepare, and Test Uti l ization Scheme r.. The output of the previous com- 
ponent operations of the algorithm provide the means for aiding four basic kinds of research 
processes. One, the data's structure within the selected multivariate model and the dis- 
tributions of the associated measures allow description of the characteristics of the popu- 
lation. Two, by transforming and crossclassifying the measures, a multivariate stratification 
of the units is obtained; thus the units and types of units may be compared. Three, the 
measures or transformed vemions of them may be combined with outside variables in the pro- 
cess of logical analysis. Four, stratifications of the units derived from various crossclassifi- 
cations of the meosures determine sompling plans. The programs and techniques for occessing 
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and arranging the output measures in the ways required for the realization of these schemes 
are discussed in Section II, and illustrations of their performance are given in Part III* 

EMPIRICAL PERSPECTIVE 

The various criteria* stated in Section I.B were ideals toward which the investigators 
aimed the proiect. The algorithm produced is a single and somewhat restricted realization 
of the ideals. The investigators' approach has intentionally been methodological and pragmatic. 
An alternative stylistic approach would have been substantive. It might have involved, for 
example, a more thorough logical analysis of input variables and an exhaustive theory of 
the organizational nature of school districts. Knowledge of individual differences among 
school districts is limited, and because the substantive approach requires prior systematic 
knowledge, the investigators felt their approach should be methodological. 

The specification and resolution of methodological issues were performed, however, 
within a substantive perspective. That is, the decision process in the development of the 
algorithm was assisted and guided by the investigators ultimate concern for providing sub- 
stantively useful results. In the paragraphs which follow, notes are made about the prag- 
matic resolutions that were made at five crucial decision points in rfie project; A) selection 
, of input, B) selection of population, C) selection of analysis, D) definition of stratification. 

I 

These notes provide a perspective for how the decisions were made and for how the decisions 
should be evaluated. 

Selection of Input . The main criteria for input was that it be already gathered, 
that it be easily retrieved and manipulated, and that it reflect a variety of district charac- 
teristics. The concern for available and manipulable data was implied by the pr igmatic 
orientation of the p.oject. But the data obtained were not used directly: elaborate trans- 
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formations were performed. The data obtained were enumerative and, for the purposes 
of the project, were not suitable as direct input for analyses. Since even the most 
sophisticated analyses cannot transform trivial observations into useful generalizations, 
some structure had to be imposed on the raw information. 



The decisions leading to the definition and construction of variables were 



\ lalaa a 

made from a substantive atid ibeoretic perspective, within the limits of project resources. 



The meaning of the algorithm's outputs and applications must be evaluated in terms of 
the viability of these decisions, as they were limited by the pragmatic attitude toward 
the collection of data. One of the purposes of the illustrations in Part IN is to demonstrate 
the substantive validity of the outputs. 



Selection of Population . The population of school districts in Wisconsin is 
really the sum of three populatipns: elementary districts, secondary districts and elementary/ 
secondary districts. Variables have differential meanings when applied to the three popu- 
lations. Consider a variable, the student/staff ratio in a district. For an elementary district 
the ratio concerns, usually, teachers in self-contained classrooms; for secondary districts, 
the ratio concerns departmentalized teachers; and for elementary/secondary districts, the 
ratio concerns a mixture of the two kinds of teachers. That is, what is conceived as a 
single variable derived for the total population is really three separate variables when the 
composing populations are considered. 



A substantive research style would have led to a careful redefinition of such 
a variable, or to an initial partitioning of the population. The pragmatic style of this pro- 
ject led the investigators to find a partial solution to the problem: namely, the dis- 
tricts with secondary schools only (there were about 50) were excluded from the research. 
The defined population consisted then of 632 districts which operated elementary schools, 
and the variables which were selected for the study dealt primarily with the qualities of 
the elementary schools. Even so, the results of the algorithm were slightly skewed'by the 
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continuing presence of an elementary and elementary/secondary distinction. This is 
discussed further in Section III .A. 



Selection of Analysis . As was mentioned above, the input spectrum was 
limited; the variables that could be constructed formed a small, biased sample of the 
set of variables that theoretically and substantively might have been desired. This 
limitation on the array of variables implied the necessity of emphasizing the data-analytic 
processes of the analysis rather than the stotistical or hypothesis-testing possibilities. The 
multivariate techniques used were chosen to provide reduction of the data in hand and to 
impose structure on it. The choice between alternative data analyses, however, was 
dictated by the meaningful ness of their outputs. 



Selection of Galculations. The calculations of the descriptions are the product 
of a certain set of variables, a certain population, and a certain mode of analysis. But 
the outputs of the algorithm characterize the districts only at a certain point in time. 
Because the characteristics of individual districts change over time, the analyses should be 
repeated periodically. For example, an important educational trend in Wisconsin is the 
increasing consolidation of local education agencies, and this tendency certainly affects 
the organizational characteristics of districts. From a substantive perspective, then, the 
selection of calculations for the present study is temporally limited. 



Definition of Stratification . Multivariate procedures were considered to be 
appropriate because the substantive perspective recognized the multivariate complexity 
of local education agencies. The interaction of the use of multivariate procedures with 
the objective of stratification has resulted in a multivariate stratification scheme. A 
“stratification" of a population typically amounts to partitioning the population into 
mutually exclusive groups by defining (often arbitrary) "cutting points" on a continuous 
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stratifying variable. Essentially, this results in a small set of ranks, and each member of 
a sample or a population is assigned one of these ranks. An example would be the socio- 
economic stratification of families in a midwest community according to family income. 
Each family would be assigned one of these ranks. 



Such a univariate stratification is inadequate for use in the general algorithm, 

s. ^ 

First, muny imporrant aspects of the entities would not be measured by that single stratifyin^ 
variable. Second, the generality of applicability of a particular univariate stratification 
is limited. A desirable alternative to univariate stratification is to use several carefully 
composed variables in stratifying. The stratification developed in this study is defined in 
Section II. D and demonstrated in Section III.B. It is a multivariate stratification, and did 
indeed use several (five) carefully composed variables to partition the population of districts 
into mutually exclusive classes. A "stratum" or "cluster", in the multivariate sense, is 
located with respect to many variables, instead of just one variable. Therefore, it is not 
possible to say that a stratum in a multivariate system is genotypically "higher". or "lower" 
than some other stratum. A multivariate stratum is a position in a typology rather than in 
a hierarchy. 
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PART II: DATA PROCESSING AND STATISTICAL TECHNIQUES 



Section A " Variables: Formation, Definition and Codification 

% 

Section B - Missingness: I nvestigat ion and Resolution of 

Missing Data 

Section C “ Factorization: Reducing and Orthogonal i zing 

the Variables 

Section D •“ Data Processing: Techniques of Computing Scores 

and Coding Districts 
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PART II 
SECTION A 

VARIABLES: FORAAATION, DEFINITION AND CODIFICATION 
STRUCTURE AND SOURCES .OF DATA 



Hidrarchy of Data. The Immediata analytic requlremeni’ was to obtain a set 
of measures for each school district. Data were available from three distinct levels: 

(1) school district, (2) school, and (3) employee. The array of data formed a hierarchy, 
since the schools were within districts, and the employees were within schools. 



In computing measures on a district, the information used could be district data, 
distributional characteristics of schools, or employee data. In this section exact data 
available at each level will be discussed, the classes of variables are indicated, and con- 
structed variables are listed. 

Available Data and Their Sources. There were three files of information. The 
first was the Wisconsin State Department of Public Instruction (WSDPI) "District/School 
Tape", which had a record for each school district and a record for each school. The second 
was the WSDPI "Employee Tape", which had a record for each school employee. The third 
file was the "Valuation Deck", which contained a card for each school district; these were 
cards punched from records obtained from WSDPI Division of State Aids and Statistical 
Services. Listed below are data contained in the three files, arranged by the hierarchy of 
district/school/employee . 

For each school district five types of information were available. They are listed 
in Table 4. The fii'st column in this table denotes the five types of information* The first 
three types were coded on the "School/District Tape", and the other two types had been 
punched on the "Valuation Deck". The second column in Table 4 givi^i the coding records 
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used by the WSDPI^ and the third column gives the recoding scheme used to transform these 
data into input for computations. 



TABLE 4 

information Available by School District 



Type of Information 


WSDPI Coding Record 


Recoding 


1 . Kind of administrative 


1. City of Milwaukee 


1 


structure of the 


2. City Unified 


1 


district 


3. City Common 


1 




4. City F. D. 


1 




5. County Unified 


2 




6. County Common 


2 




7. County U. H. S. 


2 


2. Scope of grades 


1. K-12 


1 


taught in the district 


2. 1-12 


1 




3. 9-12 


(Not Used) 




4 . K-8 


2 




• 5. 1-8 


2 


3. Class of state financial 


^ 1. integrated 


1 


aid to the district 


2. Basic with Integrated 


2 




3. Basic 


3 


4. Assessed valuation 


Dollars 


(Not Used) 


5. Equalized valuation 


Dollars 


(Unchanged) 



Note. - Source; Wisconsin SDPI "School/DIstrIct Tape", except ibat Types 4 and 5 were 

taken from the "Valuation Deck". 



For each school two types of Information were available as denoted In column 
one of Table 5. They both were taken from the "School/DIstrIct Tape". The second and 
third columns give the WSDPI coding and recoding schemes. 

. For each employee there were eleven types of Information available. They are 
listed in column one of Table 6. These data were on the "Employee Tape" In coded form as 
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given in column two of the table and recoded according to the scheme given in column three. 



TABLE 5 

Information Avaiiabie by School 



Type pf Information ^ 


WSDPI Coding 


Recoding 


1 . Type of School 


1 . Four-year high school 

2. Six-year high school 

3. Senior high school 

4. Junior high school 

5. Elementary school 


(Not Used) 
(Not Used) 
(Not Used) 
(Not Used) 
(Not Used) 


2. Enrollment 


Counts 


(Unchanged) 



Note. ~ Source: Wisconsin SDPI "School/District Tape" 



In addition to the . 'les given in Tables I, 2 and 3, the three files contained 
information sufficient to locate employees within schools, and schools within districts. 



The data which were available from these three sources formed seven classes of 
variables, and allowed the construction of 31 variables for computational purposes. The 
classes, abbreviated titles, and identification code numbers of these 31 variables are pre- 
sented in Table 7. Complete descriptions of the 31 variables are given in Appendix M. 



The remaining paragraphs of this section describe the procedures for generating the 



variables. 

CONSTRUCTION OF VARIABLES 

Teacher Characteristics. Because the stratification of the school districts was 



intended to be used in studies of district sub-units (e.g. schools, teachers and students), 
it was necessary to include input data derived from characteristics of these sub-units. 

Twelve of the constructed variables deal directly with characteristics of the elementary 

¥ 

teachers in each district. As these variables had to be attributes of districts, they had to 
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Type of Informatio n 
1. Credential 



TABLE 6 

Information Available by Employee 



WSDPI Coding Record 

1 . 1 year license 

2. 2 year license 

3. 3 year license 

4. 2 yeor term certificate 

5. 3 year term certificate 

6. 4 year term certificate 

7. 5 year term certificate 

8. Life certificate 

9. 1 year permit 

10. 1 year special license 



% 



2. Degree 



3. College Conferring Degree 

4. Division of Tirpe 

5. Months Employed 

6. Salary 

7. Local Teaching Experience 

8. Total Teachin;.j Experience 



9. Sex 



10« Position 



30 



11 . Gradfs^i or Subjects Taught 



Recoding 



1 . less than 2 years 

2. 2 years (diploma) 

3. 3 years 

4. Bachelor's 

5 . Master's 

6. 6 years 

7. Doctor's 

8. other 

Name 

1 , Percent Elementary 

9 Por^onf ^Ar^onrlnn/ 

Months 

Dollars 

Months of teaching service in 
the local district 

Months of total teaching 
service in or out of district 

No record 

School staff position codes 
differentiated various types 
of administrators and teachers, 
and non*professional employees. 
WSD^I codes used were: 

2/ Secondary Teachers 
32 Junior High School Teachers 
42 Elementary Teachers 
95 Non-professional employees 



For elementary teachers, the 
range of grades taught is indicated. 
Otherwise, the codes indicate the 
courses taught. 



8 

2 

3 
7 
6 
5 

4 
9 
1 

0 

1 

2 

3 

4 

5 

6 
7 
0 



(Not Used) 

(Used ror Tabulation Only) 

(Not Used) 

Dollars 



(Unchanged) 

(Unchanged) 



(Used for Tabulation Only) 



Grade Span 



Note* - Source; Wisconsin SDPI Preliminary Report Forms and "Employee Tape" . 
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Abbreviated Titles of Recoded Variables 
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Class of Variable 

A. Elementary Teacher Characteristics: 

V 

B. Classifications of Administrative 
Structure: 

C. Enrollment: 

« 

D. Employee Counts: 

E. School Counts by Size: 

F. Valuation: 

G. Ratios: 



Variable 

Ic Mean credential 

2. Mean degree 

3. Mean salary 

4. Mean local experience 

5. Mean total experience 

6. Mean grade spread 

7. Log-variance credential 

8. Log -variance degree 

9. Log-variance salaiy 

10. Log-variance local experience 

11 . Log-variance total experience 

12. Log-variance grade spread 

13. Kind 

14. Scope 

15 . Class 

16. Secondary 

17. Elementary 

18. Full-time elementary teachers 

19. Full-time junior hign teachers 

20. Full-time secondary teachers 
21 Other teachers 

22. Other professional employees 

23 . One-room 

24 . Two-rooms 

25 . Three or more rooms 

26. Equalized valuation 

27. Valuation/student 

28. Students/school 

29. Students/staff 

30. Staff/school 

31. Valuation/school 



I I 

Note. - The number designations for the thirty-one variables as given in column 
two are the codes used throughout the text and for all tables and 
appendices • 
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describe distributions of district characteristics, rather than characteristics of individual 

teachers. Therefore, means and variances of teacher characteristics within districts were 

used as inputs. In the computation of variances, logarithms were used in an attempt to 
eliminafe non-normal distribufions. 

y P rch©d for all records of full-fime elementary teachers 

withih each district. For dach such teacher, six codes were examined: "Credential", 

"Degree", "Salary", “Local Experience", "Total Experience", and "Grades Taught" 

(See Table 3). Six temporary variables were constructed. The first and second of these were 
recoded versions of Credential and Degr^; the records were arranged in order of increasing 
values, based on preference ratings according to WSDPI criteria: the highest ratings corres- 
ponded to the highest numeric codes. The third, fourth, and fifth of the temporary variables 
were copied directly from Sala^, Local Experience, and Total Experience . Salary is in 
dollars; experience is in months. The sixth temporary variable was constructed from the 
"Grades Taught" code. The Grade Spread was determined by counting the number of grade 
levels a teacher was responsible for in his classroom instruction; for example, if a teacher 
taughf Grade? 1-3, then the recoded value was 3. 

The arithmetic means of the six temporary variables were computed across the 
teachers within each district. These district means became Variables 1 to 6. The logarithms 

(base e) of the variances of the six temporary variables were camputed and formed Variables 
7 to 12. These twelve variables are listed in Table 7, 

Wisconsin SDPI Classifications. The WSDPI assigns to each district a code for each 
af three classifications, ^slled "Kind", "Scope", and "Class" (See Table 4). The codes were 
read from the district records on the "District/School Tape" and recoded to produce Variables 
13 to 15.. Variable 13, Kind, is "1" for city-based districts and "2" for county-based dis- 
tricts. Variable 14, ^coge, is "1“ for districts with one or more high schools, and "2" for 
district, with no high schools. Varioble 15, Qua. is coded according to WSDPI criteria for 
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distribuHrig slxate aids. 



Districts receiving the class of integrated state aids were assigned a 



code of *1", those receiving basic with integrated aids were assigned a code of ”2", those 
receiving basic aid were coded "3". 



Note that codes for these three variables are inverted with respect to the preference 

ratings assigned by the WSDPI . Low values of the variables correspond to high WSDPI 
ratings. 



Student E nrollment . The student enrollment of each school was coded in that 
school's record on the "District/School Tape", By summing the enrollments of the schools 
in a district. Variables 16 and 17 were computed. For Variable 16, the sum was taken over 
secondary schools, so that variable is total Secondary Enrollment. For Variable 17, the 

summing was taken over elementary schools, so that variable is designated as Elementary 
Enroll ment. 

f 

Teacher Counts. The records on the "Employee Tape" were examined for all 
employees in a district. Non-professional employees were ignored. The "Division of 
Time" and "Position" codes were considered (See Table 6). From the "Division of Time" 
codes It was determined whether an employee was full-time. From the "Position " codes it 
was determined which of the following positions an employee held: (a) elementary teacher, 
(b) junior high school teacher (c) high school teacher, or (d) non-teacher, usually adminis- 
trator. A teacher conceivably could have any or all of the four positions. 

Variables 18 to 20 are concerned with full-time teachers witliin each district. 
Variable 18 is the count of elementary teachers in a district. Variable 19 js the count of 
full-time junior high school teachers in a district; and Variable 20 is the count within a 
district of full-time high school teachers. None of the teachers counted in these three 
variables had non-teaching poisitions, but if they had more than one teaching position 
they were counted in tlie "higher" classification. For example, c teacher who taught both 
junior and senior high school was counted with senior high school teachen. Variable 20. 



Variables 21 and 22 are counts of the other professional employees; that is, they 
are counts of employees who ore not full-time professionals or who had non-teaching duties. 
Variable 21 is the count of those employees who did some teaching. Variable 22 is the 
count of professional employees who did not teach. 



within a 



— ool , The number of schools was derived by counting the school reconls 

district on the "District/School Tape". No infomsation was directly available on 



the physical size of the school plants. So information indicating school size was estimated 
by counting the number of teachers (on the iTeacher Tape") who were assigned to a school. 
Variable 23 is the count of schools in a district with just one teacher, and is, by inference, 
N»gber of One-Room Schools; Variable 24 is the number of schools in a district with 
exactly two teachers and hence is the Number of Two-Room Schools ; and Variable 25 is 

the count of the remaining schools within a district and is the Number of Three-or-More-Roam 
Schools. I 



The equalized valuation of each district was obtained from the 
"Valuation Deck" and designated as Variable 26. EquaX^ valuation was used rather than 
assessedvaluation, because the assessment formulas vary from district to district. 

Mi°i- Five variables weie-foimed by taking ratios of selected combinations of 
enrollment variables, staff variables, and the district valuation variable. The sum of 
Variables 16 and 17 is the total number of students in a district. The sum of Variables 18 

to 22 is the count of the total professional staff in a district. The sum of Variables 23 to 25 
IS the number of schools in a district. 

i 

Variable 27 is the ratio of valuation to the total number of students; that is,' the 
dollar \^tuation per Student. Variable 28 is the number of Students per School. Variable 29 
is the r, umber of Students per Staff member. Variable 30 is the number of Staff per School . 
Variable 31 is the dollar Valuation per ScFtool« * 



INITIAL DATA MATRIX 









A list of 31 variables has now been described, and their codes have been specified, 
e variables describes a chaiacteristic of elementary school districts and the whole 
set of variables describes seven classes of district aspects. Full descriptions of the 31 

* I I 



variables are provided in Appendix M. The means and standard deviations of these variable 
are given in Appendix B .2,' and their intercorrelations are given in Appendix B .3. 



As indicated in Section I.C, the population of concern in this project was the 

g-oup of all Wisconsin school district which had reported elementary enrollmenh greater 

than zero. There were 632 such districts; 44.5o/o of them consisted of elementaiy schools 

only, and 55 . 50/0 of rt,em were made up of elementary schools and high schools. The initial 

data matrix could not be directly used os input for the computations. The input data for 

the computing algorithm needed to consist of a complete matrix wiri, dimensions 31 x 632; 

the intersection of a row and a cdlumn in this matrix represented the -score- of c particullr 
district on a certain variable . 



One problem had to be solved before the matrix could be satisfoctorily used as 
-nput: there were missing values in foe matrix, it was not possible to obtain directly a 
measure on every variable for every district. This meant that some of the correlations 
between parrs of variables, as given in Appendix B . 3, were based on data from fewer than 



632 districts. Appendix B. 1 indicates which 



correlations were based on subsets of the 



district population; an entry in this appendix gives foe number of districts which had 
missing data for either of the corresponding pair of variables. 



no 



The causes and solution of this missing data problem are folly discussed 
Section II. B. 
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SECTION B 



MISSINGNESS: INVESTIGATION AND RESOLUTION OF MISSING DATA 



The computaHons required by the analytic algorithm, presented in Section II. C, 
were designed for the purpose of achieving the research objectives discussed in ftirt I. 

In order to perform these cqpiputations, it was necessary that the input data matrix be 
complete; that it, it could have no missing entries. But there were missing entries in the 
initial data matrix, and some procedure had to be employed to substitute for missing values. 



Discussions in this section will first consider the general problem of missing data, 
then the procedure of estimating values to replace missing data, and finally the nature of 
the input data matrix which was used for computational purposes. 



THE PROBLEM OF MISSING DATA 



Origins of Missing Data . The first twelve variables CTeacher Characteristics) 
were the only ones for which entries were missing. It has been indicated that Variables 
1 to 12 were measures of the six characteristics of the full-time elementary teachers in a 
district; Variables 1 to 6 were the district means for these characteristics, ar.d Variables 
7 to 12 were the log-variances of those characteristics. There had to be at least one full- 
time elementary teocher in a district before a mean could be defined, and at least two such 
teachers in a district before a log-variance could be defined. 



There were three types of missing data. The first type occurred when a district 
had only one full-time elementary teacher. There were ninety of those districts; for those 
ninety districts Variables 7 to 12, being log-variances, could not be defined. This first 
type of missing data accounted for almost all missir^ values. 
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In rile second type of missing data, a few full-time elementary teachers had 
improper code designations for some of the characteristics. Their codes, then, could not 
be included in computing the means or log-variances of those district characteristics. 

For two of the districts, this resulted in fewer than two codes on which to base particular 
means and log-variances. This second type of missing data accounts for only c few 
missir^ entries for specificiyariables. 

In a certain sense, there was a third type of missing data. When a district had 
two or more full-time elementary teachers, all of whom had the same value for some 
characteristic, the variance of the characteristic was zero. And the logarithm of zero is 
undefined. 



The three types of missing data, their missingness characteristics, and their re- 
placement techniques are summarized in Table 8. A detailed discussion of the replacement 

I 

techniques is given in the remainder of this section. 

Rationale for Replacement of Missing Data . As noted in the preceding paragraphs, 
and in Table , certain kinds of data were missing for three basic reasons. In order to pre- 
pare a proper input data matrix, it was necessary to place values in cells where entries were 
missing in the initial data matrix. It was desired that a rational procedure be used for 
placing values in these empty cells. The substitution of a value for a missing entry is 
rational if the meaning of the substituted value has the same meaning as computed values 
in corresponding cells for other districts; that is, if the substituted value is an indicator for 
the same underlying variable. 

Of the three types of missing data. Type C, Intra-District Equivalence, presented 
the most .trivial and elementary problem for the rational substitution of values in empty cells. 
Consequently, its treatment will be presented first. Thereafter are discussed the replacement 
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procedures for missin9 data Types A and One-Teacher Districts and Ifnproper Code 
Designations. These two types of missingness posed more serious methodological problems, 
and their arguments for replacement are developed in detail • 

I 

TABLE 8 

Types, Characteristics and Replacement 
'' Techniques of Missing Data 



Type 



Characteristic 



Replacement Technique 



A. One-Teacher District 



Only one full-time elementary 
teacher in a district, with 
valid characteristic codes; 
therefore, log-variances were 
not computable for Variables 
7 to 12. 



Regression estimates: 
For districts with no 
missing data, the rela 
tionships among 
Variables 13 to 31 
and 7 to 12 were 
det#>rmined. 



B. Improper Code Designation ‘ More than one full-time ele- Regression Estimates: 

mentary teacher in a district. Same as for A. 

some of which had one or more 

invalid characteristic codes, ^ ^ 

causing the district*s value on 

one of the Variables 1 to 12 ■ 

to be missing. 
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data. Hence, rtie low-ta-high scale characteristic was maintained for the substitutions. 

Consequently, when missing data of Type C was encountered, that is, when a variance 

comtructed for one of Variables 7 ta 12 was zero, the lowest lc«-variance for riiot variable 

which hod been computed for any distrust with a valid entry was substituted. Treated this 

way, the replacement scheme fpr missing dota Type C did not seem to raise any methodological 

or substantive issues. It wds programmed to be carried out automatically without tarther 
invesHgoMon, 



There were more serious difficulties associated with missing data Type A, where 
a district had only one full-time eleme. ny teacher. The same general rationale existed 
for substituting values; that is, the substituted values and the computed values should be in- 
dicators for the same substantive variable. In particular, the substituted values should be 

estimates of the values that would have been obtained for log-variances of teacher charac- 
ferisfics if there had been more than one teacher. 



As there was no way to directly estimate hypothesized vclues for missing data, a 
procedure needed to be developed which would yield rational predicted values for empty 
cells. Of v,e techniques available In the field of statistical application, to prablems of pre- 
diction, regression analysis is considered to be the most justifiable procedure. These con- 
■ siderations implied that the relationships should be found between the variables for which 
there were no missing data and the variables which did exhibit missingness; and that those 
relationships should be used to estimate values for the missing entries. Before these rela- 
tionships were computed, prior investigation was necessary to assure that the relationships 
between Variables 7 to 12 and Variables 13 ta 31 were relatively constant ^er all districh. 
Discussion in the following paragraphs reports these analytic investigations, ^ 
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ESTIMATING VALUES 



Properties of Missing Data . The missing data to be investigated. Types A and B, 
occurred in such patterns that, for purposes of data analysis, missingness could not be con- 
sidered to be random; that is, missingness was not independent of the processes being studied, 
for the' cfiitricts with missing data had few full-time elementary teachers. Having few full- 
time elementary teachers is a characteristic of a district which might be expected to correlate 
with other characteristics, such as size or organization. 









In order to measure the relationships between missingness in Variables to 1 to 12 
(means and variances of teacher characteristics) and Variables 13 to 31 (district characteristics), 
a correlational analysis was performed. AM districts were included in this analysis. There were 
31 attributes: the first 12 were dummy variables constructed from the first 12 original variables, 
and the last 19 were the last 19 original variables. The dummy variables were computed by 

f 

coding *1* for entries which were not missing and *0' for entries v . i were missing. The dummy 
variables may be called "non-missingness" variables. Substituting these dummy voriabies for 
the first twelve original variables resulted in a re-definition of the initial 31 x 632 data 
matrix. In the re-defined matrix. Variables 1 to 12 were dummy variables, and Variables 13 
to 31 were the original variables. Using this re-defined matrix, means, standard deviations, 
and correlations of the variables were calculated and are presented in Appendix A. 

The mean^ of the attributes in Appendix A. 1 a re, considering the method of constructing 
the dummy variables, just the proportions of districts for which the cor»'esponding original 

variables were not missing. Thus Variables 1 and 2 were missing for just a few districts— 

« 

actually, 4 and 3 districts, respectively. Variobies 3 to 6 were nowhere misting. And each 
of Variables 7 to 12 were missing for about 14% of the districts. The standard deviations of 
these dichotomous dummy variables are not useful, and the means and standard deviations of 
the last 19 variables are discussed !ater. ^ 
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InfercorrelaHons of fhe aftributes in fhe re-defined mafrix are presented in Appendix 
A. 2. The intercorrelations of Variables 7 to 12 rrre all approximately equal to 1 #00^ and it 
is clear that almost all the missing data was of die first type; that i$^ most districts had all of 
Variables 7 to 12 missing, or none of Variables 7 to 12 missing. 

From examining the correlations between the dummy variables and original variables, 
it is cledr that missingness is fndeed related to other district characteristics o Districts without 
missing data tended to have high schools ( r = -.46 with Scope) , integrated state aid ( r = -.54 
with Class ), many students per school ( r = .46 with Students per School), and many staff 
members per school ( r = .47 with Staff per Scho ol )» These rather high correlations evoke some 
doubts about the substantive accuracy of assuming that relationships between Variables 13 to 31 
and Variables 1 to 12 were the same for districts with missing data as they were for districts 
without missing data. This condition was observed, but investigations of its seriousness were 

beyond the limits of the project. Though this problem could not be corrected, it was still 

* 

possible to maximize the use of available information. This maximization will be elaborated 
in following paragraphs. 

Maximum Information Correlations . The next step leading to the actual replacement 
of the missing data was to perform another correlation ar<alysis. The purpose of this analysis 
was to provide the best possible estimates of the true intercorrelations of the 31 input variables. 
In this analysis, the information in the original data matiix was used. But since there were 
missing entries in the matrix, standard correlation procedures could not be followed. For each 
coefficient (mean, standard deviation, or correlation) computed, only non-missing data could 

be used; the analysis used maximum information by using all non-missing datr for the compu- 

< 

tations of the coefficients. This resulted in different numbei'S of districts being involved in 
computations of the di.ferent coefficients. The results of the maxim um information analysis 
are presented in Appendix B. 
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In Appendix B.1 is given a matrix of counts of districts. Each entry in the matrix 
corresponds to a pair of original variables, and is the number of districts for which neither 
variable was missing. Each diagonal entry corresponds to a single variable, and is the 
number of districts for which that variable was not missing. As expected, the intercounts 
for Variables 13 to 31 are uniform-ly 632, which is the number of districts in the population. 

This indicates that there were no misslr^ data for Variables 13 to 31 . The intercounts of 
Variables 1 to 6 and the crosscounts of Variables 1 to 6 and 13 to 31 are about constant at 
632: this indicates that there are some missing data of the Type B, improper Code Designations. 
The intercounts of Variables 7 to 12 and the crosscounts of Variables 7 to 12 with the other 
variables are about constant at 542; missing data Type A, One-Teacher Districts in 90 districts. 
There were few mi:Jng data of the Type B in this segment of the matrix. 

The maximum information means and standard devictions are presented in Appendix 
B.2. Each mean and standard deviation is based on all districts for which the variable was not 
missing; that is, it is based on the number of districts indicated in the corresponding, diagonal 
entry of the count matrix of Appendix B. 1 . The maximum information correlations are pre- 
sented in Appendix B.3. Each correlation coefficient is based on all districts for which neither 
correlate was missing; that is, it is based on the number of districts indicated in the corres- 
ponding entry of the count motrix of Appendix B. 1 . 

In a sense, the maximum information coefficients provide a criterion of adequacy for 
the replacement of missing data technique. After substituting values for the missing data, 
changes in the means, standard deviations and correlations should be negligible. Substantial 

changes in these values would indicate systematic bias in the replacement procedure; the 

< 

definitions — that is, the meanings — of the variables would have been changed. Later it will 
bfe indicated that the coefficients were not significantly changed by the replacement operation. 
Since missingness was shown to be related to the district characteristics, some changes in rela- 
tionships among variables would be expected when missing data substitutions were made and 
all 632 districts included in the analysis. 
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Ho /i ig established maxitnutn information coefficients as criteria for the results of 
replacing estimated values for missing data, the operations for computing the substitution 
estimates were undertaken. 



The Substitutions Operation. The generaJ strategy for finding values to substitute 
foi missing data involved two distinct phases: firsts for districts with no missing data, the 

s s 

relationships connecting Variables 13 ta 31 with Variables 1 to 12 were determined; second. 



for districts with missing data, the values of Variables 13 to 31, which were never missing, were 
iranipulated according to the derived relationships. This produced estimates of Variables 
1 to 12 for districts with missing data, which were then substituted for the missing values. 
Multiple regression procedures were used to determine the relationships between variables 



with missing data and variables with no missing data. Twelve linear regression equations were 
required; each of Variobles 1 to 12 was predicted by the set of Variables 13 to 31 . 



The first phase, then, involved finding the twelve regression equations. This 
necessitated performing a correlation analysis. The attributes for the analysis were the 31 
original variables. No values were missing, since the entitites were those districts for which 
no data was missing— there were 539 such districts. The results of the correlation analysis 
are presented in Appendix C. In Appendix C. 1 are the means and standard deviations, and 
in Appendix C.2 are the correlations. It should be noted that there are discrepancies between 
these coefficients and those of Appendix B, where all information is used. This is caused by 
the fact that about 90 districts have been omitted, and those 90 districts are not randomly 
scattered, but rather are all districts with few elementary teachers. For example, the c istricis 
in the reduced sample have, on the average, about 134 more elementary students; that is, 
the average elementary enrollment of the entire population of 632 districts Is 798, whereas the 
average elementaiy enrollment Is only 932 when the 90 single-teacher districts are omitted. 
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The regression equations were determined from the t . jlotions given in Appendix 
C. The standard form of the equations is presented in Appendix D. Each row contains the 
beta coefficients for predicting one of the Variables 1 to 12 from the nineteen Variables 
13 to 31. The beta coefficients are followed by the squared multiple correlation coefficient 
for the equation. The inference is that the regression equations are able to account for from 
13% (Vo(iable 8) to 53% (Variable 2) of the variation of Variables 1 to 12. Although this 

I 

is hardly perfect prediction, a significant amount of the variation is being predicted. If 
the prediction were perfect, then Variables 1 to 12 would be strictly redundant. 

From the regression equations of Appendix D,. end the never-missing values of 
Variables 13 to 31, the regression estimates of Variables 1 to 12 were computed for those 90 
districts that had missing data. These regression estimates were then substituted for missing 
data. 

4 

$ 

The replacement scheme adopted for missing doits Type A, One-Teacher Districts, 
also seemed satisfactory for resolving the problem of missing data Type B, improper Code 
Designations, of which there were only two cases. 

THE INPUT DATA MATRIX 

As a result of the substitution procedures outlined above, the data matrix was com- 
plete: all empty cells had been filled. The remaining step was to determine whether the 
maximum information coefficients of Appendix B had been substantially altered. For this 
purpose, a correlation analysis was performed, with all 632 districts as entities, and the 
original variables as attributes, with reg.ession estimates substituted for missing data. The 
results of this arKilysis are presented in Appendix E. Appendix E,1 contai^'iis the means and 
standard deviations of the variables, and Appendix E.2 contains their correlaHons, 
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' The difference befween (he coefficienfs of Appendix 6 and those of Appendix 
E ore small. In general, differences occur in the second or third decimal place, end they 
occur primarily in the means, standard deviations, and correlations of Variables 7 to 12. 
Any change of importance would have occurred among these variables, because esti- 
mated data was provided for a special subset of the population of districts. As important 
chahges did not occur among these variables, further investigation was not warranted. 
Because of the small differences caused in correlations by using regression estimates to 
replace missing data, the data matrix with these estimates substituted was considered to 
be a satisidetoiy Input Data Motrix, and served as the basis of all subsequent analyses. 



PART II 
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SECTION C 

FACTORIZATION: REDUCING AND ORTHOGONALIZING THE VARIABLES 

The 31 variables of the input data matrix were transformed into a smaller 
number of uncorrelated (orthogonal) variables for stratification. Multivariate tech- 
n^ques were employed in reduction and orthogonalization. In this section is first 
discussed why these operations were necessary. Then the two computing algorithms 
used are presented and discussed. Finally, the reduced variables derived from the 
two algorithms are discussed, interpreted, and compared. 

METHODOLOGICAL ISSUES 

^ The Need for Summarization and Reduction . One of the purposes of this 

study was to obtain a set of variables which could be used for stratifying districts. 

As described in the previous sections of this Port, the original information available 
for this purpose was manipulated to produce the input data matrix of 31 variables. 

For the purpose of district stratification, this data matrix might possibly have been used m 
any one of several ways: 1) ' II 31 input variables might be used as a set of stratifying 
dimensions; 2) a sub‘~set jf the 31 variables might be selected ond used for stratifica- 
tion; or 3) an efficient summarization a .d reduction of all 31 variables might be 
derived. From the methodological viewpoint stated in Piart I, the third of these 
alternatives, summarization, was considered the most rational choice. 

Use of qll of the 31 variables as a set of stratifiers was impractical, since 

these were too many variables for construction of a meaningful or useful stratification 

31 

scheme. For example, if each variable was dichotomized at its median, 2 dis- 
trict-categories would have been defined. Since there were only 632 districts, most 
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of these categories would hove been null, and only a few districts would have been 
grouped together in even the largest category. Thus no efficient categorization, or 
stratification of districts, would have occurred. ^ 

Selection of a sub~set of the 31 input variables would require justification 
for the inclusion or exclusion of each variable. However» each variable is assumed 
to be an important indicator of a district characteristic. All of these important 
characteristics should be somehow included in a stratification scheme; to exclude some 
of these characteristics from the stratification would be to ignore the acknowledged 
multivariate complexity of the districts. Furthermore, as discussed in Part I, the 
selection of a small sub*-set of the 31 variables would be equivalent to the formulation 
of an ad hoc theory for stratification. 

Summarization and reduction of the 31 variables is desirable in order to simplify 

t 

the characterization of districts, and yet maintain sufficient descriptive complexity. 
This procedure implies that no single variable contains sufficient informatron to be 
accepted as a stratifying dimension. Rather, each variable is an "indicator" of an 
underlying dimension. The co~relationships of several indicators, or variables, when 
taken together as a composite would be expected to define an underlying dimension* 

For example, the enrollment variables (Variables 16 and 17) and the number-of- 
employee variables (Variables 18 through 22) might be hypothesized as indicators 
of the same underlyirig factor, say, district size. Thus it was considered that the 
factors hypothesized as underlying the 31 variables would be more fundamental, 

and hopefully more meaningful dimensions than the original variables for the purpose 

» *. 

of stratification* 
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OrtfiogonalizQfion of the Variables* The reduction of the number of vari- \ 

ables was accomplished using multivariate techniques. These techniques allowed 
the mathematical derivation of composite variables which were based on the original j 

31 variables. It was possible to derive composite variables^^hich were orthogonal 
(uncorrelated) to one^another. This orthogonalization was particularly useful in the 
stratification, and also presented statistical advantages in comparing the composite 
variables with other (outside) variables. 



Multivariate Analysis. The need for reduction in the number of variables and 
the advantages of orthogonality implied the use of multivariate analysis/ in particular 
they implied the use of certain techniques generally classified as factor analysis. 

Hams (1955) discusses two possible types of factor analysis models: communal ity and 
non-communality moflels. With general communality-type models, the factors lie 
outside the variable space, and the factor scores are estimatable but not computable* 
But with certain non-communality models, the factors lie within the variable space, 
and factor scores for entities are exactly computable. For this study, the scores for 

r 

the factors derived from the originaT 31 variables were to be used for siratification, 
so it was necessary to use a model wherein the factor scores were computable. There* 
fore only non-communality analytic models were considered. 



The two forms of non-communality analysis considered were Hotelling's 

(1933, 1935) component analysis and Harris' (1962) version of Guttman's (1953) 

image analysis. The concepts of image analysis are based on certain linear trans- 

« 

formations of the original variables; the concepts of component analysis are based 
directly on the original variables. For the present study, it was decided to fry both 
forms of analysis. For each form, the analysis consisted of two operational phases. 
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First, *8 mah-ix to be factored wci defined and its unrotated factors were computed. 
Second, the normal varimax rotation procedure (Kaiser, 1958) was applied to the 
unrotated factors, and a new sot of factors~the rotated factors— was obtained. For 
each form of analysis, the firat phase was ..esigned to bring forward certain theoretic 
qualities of the variables, and the second phase was designed to provide interpretable 



factors. 



analyses 



The analyses for botij the component and image models are presented here, 
together with discussion of their theoretic properties and their substantive interpretations 
for this study. The results from the application of the two models are then compared. 



Component Analysis. The inputs to the component analysis consisted of the 
31 by 632 input data matrix and its correlation matrix R (given in Appendix E.2). The 
first obiective of the component analysis was to find an orthogonal basis for the variable 
space. This meant finding a factor matrix F ^ which corresponded to the correlations 
between the original variables and a set of factor scores . These factors have the 
following properties: 

Cl D distributions of factor scores ore uncorrelated; 

C2] they are linear combinations of the original variables; 

C3] they span the space of the original variables. 

Such an F^ would satisfy the relationship F^ ■ = R. In this study, R was non-singular, 
and there had to be 31 factors in order for [3] to be satisfied. Hotelling (1933, 1935) 
gave a technique for finding an F ^ which also has the property: 

[4] the first factor accounts for a maximum amount of variance in 

the variables; and each succeeding factor occounts for a maximum 

of the remaining variance, all given thot each factor is uncorrelated 
with all the preceeding factors. ’ 



I 
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That F,. IS ealled the principal component factor matrix. Property [4 ]a I lowed as 
much variance to be accounted for in as few variables as possible. Although the 
property was lost in the rotation of the basis, it was possible to see how much com- 
pression of the 31 variobles could be achieved. 

The computation of F ^ proceeded as follows. First a latent root and vector 

resolution of the correlation matrix was obtained. The latent roots are presented in 

Appendix F. 1; they are'considered to be the diagonal entries of a diagonal matrix 

2 

called M . The latent vectors are presented in Appendix F. 2; each column is the 

unit-length latent vector for the corresponding latent root in M^. The matrix of 

latent vectors is called Q . It follows from the definition of latent roots and vectors 

2 

that R — QM Q . Hotelling (1933, 1935) showed that the factor matrix Fj.,that 
corresponds to the factor scores v/hich satisfy properties [1 ] through [4] above^is given 
byF^ = QM. ltcanbeseenlhatF^F^' = (QM)(QM)'=QM^Q' = R. The matrix 
F ^ is given in Appendix F .3. , Each row corresponds to an original variable, and 
each column corresponds to a factor; the factors are arranged in decreasing order of 
variance accounted for. Appearing before each row is the row sum of squares, which 
is always 1 .0 since all the variance in each variable is accounted for by the set of 
factors. Above each column appears the column sum of squares which is the amount of 
variance accounted for by the factor and which is equal to the corresponding latent 
root. The amounts sum to 31, and are additive since the factors are uncorrelr ed. 

For example, the first factor accounts for about 309S of the variance, and the firet 
five factors account together for about 70% of the varian ce. 

The factors given in F ^ had the advantage of compression, but they were not 
very interpretable; that is, they could not be readily identified with single substantive 
dimensions. So a rotation was performed. An orthonormal matrix T^ has the property 
~ identity matrix; then a rotation of F^ can be written 



Frlf since (FfTr) (F^ Tf) F|.T^T|. F^. F^F^ = R, Such rotafions can pro- 
vide more interpretable factors, but some of the compression is always lost. But 
Interpretability was essential for the stratification, and the nonnal vorimax rotation 



procedure (Koiser, 1958) was applied to secure a . With the normal varimax 
procedure, the variance of the squared, row-normalized entries of F|,T is maxi- 
tnized, and Kaiser cairns that a) interpretable, simple-structure factors result, and 
b) those factors are relatively invaricnf under changes in the variable selection. 
Qualityf b ]is nice to contemplate, but impossible to test. QualityC a ]con be 
verified by examining the rotated factor matrix. 



The varimax transformation matrix Tj. is presented in Appendix G,1 . The 
rotated factor matrix F is given in Appendix G.2. Its rows correspond to the 
original variables, and its columns correspond to the rotated factors. The rows are 
bordered by row sums of squares, which are unifoimly 1 .0 since all the variance in 
each variable is accounted for by the factors. The columns are bordered by column 
sums of squares, which equal the variances accounted for by the factors; the facfors 
have been arranged in decreasing order of variance accounted for. Since the factors 
are uncorrelated, the variances ore additive. The first factor accounts for about '£% 
of the variance, and the first five factors account for about 50% of the variance . It 

can be seen that the compression has been relaxed, since for the unrotated factors the 
corresponding figures were 30% and 70%. 



The factor matrix (AppendiJ G.2) gives the correlations between the factors 
and the original variables. But its entries are not the weights needed to compute the 
factor scores from the original variables. (See Gloss, 1966.) The weights are the 
entries of the matrix QM"’t (Kaiser, 1962), which is presented in Appendix G.3. 

Act tally, the columns of the matrix have been nomralized for printout purposes, but 
proportionality within columns is correct. 


















Image Analysis. The input to the image analysis was the 31 by 632 input 
data matrix and its correlation matrix R (given in Appendix E.2). But conceptually^ 
the image analysis dealt with a transformed set of variables — the images of the 
original variables. In image analysis, each variable is conceptually partitioned 
into two parts: the image variable, which is the original variable as predicted by 
linear regression from the (>ther original variables; and the anti“image variable, which 
is the regression residual. Guttman (1953) has shown that in the limit, as a universe 
olF content becomes permeated with variables, image analysis approaches communal ity- 
type factor analysis. That is, the image and ,anti“image variables are approximations, 
within the original variable space, of the common and unique parts of the varinbles 
in a communality type analysis. To the extent that the approximation is accurate — and 
to the extent that the accuracy is associated with the intercorrelations of the anti-image 
variables— using factors derived from the image variables may lead to a more reasonable 
model for the data. Each variable has a certain portion of unexplained (unique or 
anti-image) variance; it is not necessary to consider each variable as being entirely 
contained in a common factor space. 

Another important property of image analysis is that the results are scale-free. 
With an analysis such as component analysis, the matrix factored is the correlation 
matrix, which is the covariance matrix for variables with variances of exactly 1 .0. 

If any rescaled version of the correlation matrix, corresponding to variables with 
different varionces, is factored, different results are obtained . But Guttman (1960) 
noticed that image analysis produces the same results no matter what scaling is applied 
to the image covariance matrix. As Kaiser (1963) states: "We are freed from the 
traditional agnostic confession of ignorance implied by standardizing the [ variables] . 
Here, standardization is merely a convenience to which we are in no way tied." 

The rotation procedure employed later (normal varimax) preserves this scale-fre.e property 
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Because of certain matrix identities, it was not necessary to construct the 
31 by 632 matrix of image variables in order to pertbm, the image analysis. Rather, 
the computation could proceed directly from the correlation matrix R of the original 
variables. (See Kaiser, 1963.) First the inverse R"’ of R was obtained. The recipro- 
cals of the diagonal entries of R"’ were considered to be the diagonal entries of a 
ytogonal matrix S^; fhese were the variances of the anti-image variables and ore 
presented in Appendix H.l . Each of these entries is the proportion of the variance 
in the corresponding original variable which was not predictable from the other 
original variables. Same variables in the present study (far example, 16 to 20) were 
almost entirely predictable; othem varied from about 11% to about 67% unpredictable. 
The precise tmnsfbnnation from original to image variables is given in Appendix H. 2. 
The matrix W = 1 - S^R"’ which appears there has rows corresponding to original 
variables and columns corresponding to image varitbles. Each column is the reg 
equation for predicting the corresponding original variable from the other thirty van 
ables; that is, for calculating the image variable from the original variables. For 
printout purposes, the matrix has been column-normalized, but proportionality within 
columns is correct. The image variables do not eqacl the original variables; they are 
non-singular linear tnsnsformations of the original variables, and the varU.nce of an 
i„oge variable is that part of th- variance of the corresponding original variable 

which is predictable from the other original variables. 

As a test of the fit to pure factor analysis, the correlation matrix of the anti- 
image variables was produced. It appears as Appendix H.3, and it was computed 
as SR"’ S. The unique parts of pure factor analysis are assumed ta bit perfectly un- 
correlated; so to the extent that image analysis is an approximation of pure factor 
onalysls, the anti-image variable, dtould be uncorrelated . That is, the matrix in 
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Appendix H.2 should be essentially diagonal. It is diagonal to a certain extent; that 
is, most of the entries are rather small"less than, say, 0,2. But there are some 
rather large entries, on the order of 0.9. This is somewhat disconcerting, but then 
again, the pairs of variables indicated by very large entries are very highly correlated, 
and they load on the same factor. 

The covariance matrix G of the image variables was the matrix factored, and 
the factorization scheme employed was that of Harris (1962). First the matrix RS"^ 
was formed, and its latent roots and vectors were determined. The latent roots are pre- 
sented in Appendix 1.1 and are considered to be the diagonal entries of the diagonal 

2 

matrix ; the corresponding latent vectors are presented in the columns of Appendix 
1.2 and are considered to form the columns of the matrix X. By the definition of latent 

roots and vectors, S “ ^ RS " ^ XB ^ . Harris (1962) showed tha , S " ^ GS “ ^ = XB ^X ' , 

2 2 2 '-2 ^ 
where B^ “0 .So the Harris factor matrix for G is = ^XB^ and 

Fq Fg “ (SXB ) (SXB ) =SXB ^X'S=SS ^ GS = G. F appears as Appendix 

1.3, where it is bordered by row and column sums of squares. The row sums of squares are 

the image variable variances. The column sums of squares are the amounts of image variable 

variance accounted for by the factors, and these amounts are additive, since the factors 

are uncorrelated. Each set of sums of squares sums to 22.4 which is about 72% of the 

31 units of variance that were in the original variables.' 

Note that the Harris factors of G were obtained rather than, say, the principal 
factors based directly on the latent roots and vectors of G. Certain features of the 
Harris factorization provide insight into the data. Note that the first- 19 roots of G, 
which correspond to Harris roots of R greater than 1 .0, decrease monotonically; and 
the last 12 roots of G, which correspond to the Harris roots of which are less than 1 *0, 
increase monotonically. Harris (1962) showed that; 



Cci ] if image analysis is regarded as a first approximation to canonical 
Factor analysis, then the first set of factors correspond to the real 
canonical correlations; 

t b ] the number of factors in the first set is the strongest lower bound, 

^ devised by Guttman (1954), on the number of common factors in the 

set of variables; 

[c 3 small roots in the second set imply high off-diagonal entries in the anti- 
image correlation matrix; that is, poor approximation to communality- 
type factor analysis. 

The ist point is implicit in Harris' (1962) formulation, and is made explicit in Kaiser 
(1963), From a formula given by Kaiser (^963, page 164) it wns calctl ated that the 
second set of factors accounts for about 68% of the sum of squares of Hie off-diagonal 
entries of the anti-image correlation matrix. 



Harris (1962) suggests that incomplete image analysis, employing only the first 
set of factors, is desirable. That is, he suggests that the first set of factors should form 
the basis for the entire image-analytic model of a given set of data, on the ^rounds that 
they are the part of the original variables which maximize the approximation to com- 
munal ity-type factor analysis. For three reasons, however, his advi'ce was not followed.. 
First, the omission of the second set would have left still about 32% of the sum of 
squares in the off-diagonal regions of the anti-image correlation matrix. Significant 
entries would have remained there. Second, most of the Harris -Guttman reasoning 



c 

assumes an application to the kinds of verlQbles obtainable from psychological testing, 
for which approximation to a universe of content is conceptually reasonable. It was 
not clear that the demographic and enumeration variables of this study could be said to 
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approximate a universe of content; they were selected according to availability 
rather than according to conscious permeation of a theoretical universe. Jhey were 
skewed rather than normal . Third, for purposes of comparison, it was considered use- 
ful to maintain compatibility with the component analysis, in which ail factors were 
retained. Consequently, all of the image factors were used, and the rotation destroyed 
Vie properties of the Harris factorization; any complete factorizotion would have led 
to the same rotated fortor matrix. 



in order to obtain a substantively interpretab ie basis for the factor space, the 



1 



normal varirnax rotation procedure was applied. The transformation matrix T ob- 

9 



tained is presented in Appendix J.l, and the rotated image factor matrix F T is 

w w 



presented in Appendix J.2. The rows and columns of F T are bordered by sums of 

9 9 



squares. The row sums of squares are again the variances of the image variables. The 
column sums of squares are the variances accounted for by the factors and have been 
arranged in decreasing order. Since the factors are uncorrelatod, these variances are 
additive. The first factor accounts for about 36% of the image variance, and the first 
five factors together account for about 87% of the image variance. This is not the 
most compressed basis possible for the factor space, but it will be shown that the fac- 
tors obtained in this study are quite interpretable. Because the factor scores were 



to be computed, the factor weight matrix was calcu lated. It appears as Appendix J.3 

-1 



and was computed as SXB T ; the columns correspond .to image factors and give the 

. r g 

linear functions for calculating the factor scores from the original variables. For print- 
out purposes, the columns have been normalized, but the proportionality within each 
column is correct. ' 



The application parallels that made to the components anolysis, q*v. 
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Comparison of Component and Image Models . A rotated component analysis 

and a rotated image analysis were performed. For the purpose of stratificotion, one 

of the analyses needed to be selected. The theoretic qualities of the analyses were 

to be considered in making the choice, but it was also necessary to consider the sub~ 

itantive interpretability of the factors. Ultimately, the strata were to be used for 

answering substantive questions. The factor matrices were used for interpretation 

and comparison. Appendix G.2 is the factor matrix for the component. Appendix J.2 

for the image models. The first five factors of each are abstracted in Table 9, where 

the name and loading of each high-loading variable on each factor is given. The 

image factors were finally selected and named, and in Table 9- the names assigned 

to the image factors are given. Also, the correlations between the factor scores of 

the two analyses were computed, as T ^ M Q RS "^XB T / and aopears as 

r r g ' 

Appendix K . 1; the rows correspond to the rotated component factors and the columns 
to the rotated image factors. 



Theoretical considerations led to the choice of image factors for stratification. 
First, the image factors were scale-free, and no assumptions had to be made concerning 
the relative metric properties of the original variables. Second, the image factor 
model is an approximation to the more reasonable communality models, for which it 

is not necessary to assume that each variable is entirely contained in a common-factor 
space . 



A theoretical advantage of the component analysis would have been parsimony/ 

< 

thCi iiiie; uriiuiuieu CumpCncnf GilCii’/SiS GXiTCICia ihc ffiOSt VOrloilCB fll the fswest 

factors. For the present data, this is indeed true if we consider the unrotated com- 

* 

ponent factors and the unrotated image factors; the first five component factore account 
for more variance than the first five image factors. But for the present dafa> this advantage 



TABLE 9 

A Comparison of Rotated Factor Structures 
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is lost when the factors are rotated. The first five rotated component factors account 
for about 51% of the variance of the original variables. The first five rotated image 
factors account for about 87% of the variance of the image variables, and for at 
least 60% of the variance of the original variables. So, in terms of rotated or 
meaningful structure, the image factors are more compressed and parsimonious. 

The greater paVsimony of the image factors leads us to the most important 
reason for choosing them as stratifying dimensions: they are more interpretable. It 
can be seen from the crosscorrelations of the two sets of factors (Appendix K.l) that the 
first and second rotated component factors are. essentially equal to the first and second 
rotated image factors. The first and second image factors have been named "Numerical 
Size" and "Organizational Complexity". In the image analysis, most of the informa- 
tion of the input data matrix variables has been compressed into the first five factors. 

Factors One to Five are truly clusters of variables. And, as will be seen in Part III, the 
factors are reasonable and meaningful . On the other hand, rotatfcd component factors 

Three to Sixteen are all of essentially the same magnitude in accounting for variance. That 
is, beyond the first two factors in the component analysis, no real clustering of the 
variables has occurred. That is, most of the factors, specific factors, each with just 
one high-loading variable. The first two factors woujd be useable for stratification, 
but the others are not substantially different from the input variables. 
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PART II 
SECTION D 

DATA PROCESSING! TECHNIQUES OF COMPUTING SCORES AND CODING DISTRICTS 

The first three sections of Port II discussed conceptual and empirical problems of 
the input variables, of'misslng data, and of factorization. This Section D presents the 
procedures and problems of using the computer to manipulate input data to provide sub- 
stitutions for missing data, and to derive the composite variables. Furthermore, it is 
demonstrated how the computer was programmed to assign codes for Pie composite vari- 
ables to loc-l education agencies, so thot the district population could be stratified. 

/ 

data reduction programs 



I 

i 

I 

I 

I 

i 

I 

A 

1 

1 




Monipulating Input Palo . The original sources of data were the "School/ 
District Tape", the "Employee Tope", and the "Valuation Deck". These sources were 
described in Section II. A. The first computer program called, program SSVAR, had as 



input these three files. The essential function of the program was ta merge me informa- 
tion from the three files, and to produce as output a new tape called the "Variable 

Tape". 

For each district, the program assembled the following records: each district's 
record, from the "School/District Tape"; all of the school records for each district, 
also from the "School/District Tape"; all of the employee records for each district. 




from the "Employee Tape"; the equalized property valuation for each district, from the 
"Valuation Deck" . The program then checked whether each district had any elementary 
enrollment. If the district did have elementary students enrolled, the construction of 
the variables pioceeried. Districts with no elementary enrollment were omitted from 
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furrtier consideration. Construction of the variables involved crossreferencing the 
collected records, extracting codes and recoding, counting and summing, and forming | 

means and variances. As the program proceeded from one district to the next, alJ of j 

the data were accumulated in a large table in the computer stoitige. Special codes I 

were stored in this tcble when missing data were detected for a district. Finally, | 

wtren the last district record was encountered, the program went into its second phase. I 

Subs^i^uting for Missing Data . The second phase of program SSVAR was concerned I 
with testing and replacing the missing data. The operations and results have been dis- 1 

' K 

cussed in Section II. B. In this phase, program SSVAR went through four steps. First, 

'i 

it performed three correlational analyses and produced as output ihe matrices appearing I 
in Appendices A to C. These analyses provided tests for the characteristics of the 
missing data. But the prograrn was set up to continue with the replacement operation. | 

In the second step, the regression coefficients were computed; they are presented in 
Appendix D. Third, the regression estimates for the missing data were computed and sub- I 
stituted into the internal table. In the final stage, a correlational analysis was performed 
which was based on the matrix with missing data replaced by regression estimates; the 1 

results of this analysis appear in Appendix E. 




The program SSVAR punched the final correlation matrix on cards, together with 
the final means and standard deviations of the variables in the input data motrix. The in- 
put data matrix was written on magnetic tape in a binary format. Fourty-four numbers 
were written in a record for each district. The first of these numbers was the WSDPI 
district identification code; the next 3] numbers were the district's values for the 31 
variables of the input data matrix; the final 12 numbers in the record were repeats of 
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iJie first 12 variables, but with special codes for missing data instead of regression 
estimates. Thus it could be determined, from a tope printout, which values were 

regression estimates and which were actual data. The result was the "Variable 

Tape". A listing of its contents was made for purposes of checking and reference. 

FoctorAnolyse| . After the output from program SSVAR wos checked and 
decisions were made about the results, the next program, called program SSFAC, 
was run. This program had as input the "Variable Tape" and the punched means, 
standard deviations and correlations produced by program SSVAR. Program SSFAC 
performed the components analysis, the image analysis, and the cross-correlotionai 
analysis. Then the program computed the rotated components and rotated image 
factor scores, and added them to the "Variable Tape", The modified "Variable Tape" 
then contained 106 numbers in a record for each district: these included the original 
44 numbers, plus 31 rotated cpmponents factor scores and 31 rotated image factor 
scores. Finally, the program produced a listing of all the factor scores, to corres- 
pond to the listing of the original variables, for purposes of checking and reference. 

Ijmitations of the Programs. The above description of the programs SSVAR and 
SSFAC is oversimplified. Although the programs did indeed perform the operations 
indicated, they were complicated programs, and involved a variety of difficult problems. 
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First, there were problems of tape compatibility. The tapes obtained from the 
WSDPI had been prepared on IBM computers, and the processing described here took 
place on the University of Wisconsin CDC computers. Although the IBM and CDC 
tape reading/writing schemes are theoretically compatible, there ere subtle differences 
in intensity and al ignment of the magnetic recordings. Tapes written on IBM equipment 
are difficult to read on CDC equipment. The initial runs of program SSVAR were on- 
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successful because of tape reading error^s. Seven, duplicate tapes from the WSDPI 
were tried. Finally, the University's auxilary IBM 1460 computer was programmed 
to copy the WSDPI tapes. The 1460 had been adjusted by IBM engineers to have 
recording characteristics which are a compromise of the IBM and CDC p Juliarities, 

The copies produced by the 1460 could then be read by the CDC computers. 

\ - 

Second, there were coding problems within the WSDPI tapes. For some yet 

unknown reason, one particular school did not have a school record on the "School/ 
District Tape". This caused the merging operation of program SSVAR to halt. A 
special patch was placed in the program to skip, the employees of that school. 

Third, the entire operation was performed originally for all districts in the 
state. But upon examination of the results, it was apparent that the inclusion of dis- 
tricts with no elementary schools was warping the factor structure and stratification. 

So the programs were rerun with the additional instructions to omit districts which in- 
cluded only high schools. 

The point of the above notes 1$ that the programs wrere specially written for the 
data received from the WSDPI. They ere special programs for operating on special data. 
Program SSVAR accepts data only in the particular format of WSDPI tapes. And pragram 
SSFAC operates only on the 31 variables defined for this study, and only with the 
particular inputs from the first program. The programs are not useful or useable for any 
other study. The only case where these programs could be used without modification 
would be a replication study, using WSDPI information for another year, say, 1967. 

Even then, the programs would need slight adfustments to meet the particular peculiarities 
and problems of the 1967 data. The contributions of this project are the production of 
the results for the particular data, and the development of a general methodology. 



The Data Bank . The hard result of the computer processing is the "Variable 
Tape". This tape contains the original 31 variables; the codes for real data and for 
data which are regression estimaires; the rotated components factor scores; and the 
rotated image factor scores The tape is written in such a form (floating-point binary) 
that it can be used only on CDC computers, but CDC computers could be used to trans- 
fer the data onto cards or onto IBM-compotible tape. For analyzing the outside vari- 
ables described in Section III.C, for example, the first five rotated image factor scores 
were punched on cards for input to a standard statistical program. The stratification 
program described below, on the other hand, operates directly on the "Variable Tape". 
The factor scores give composite measures of the qualities of the districts. The original 
variables give direct indices df the qualities, such as enrollment, of the districts. In 
summary, the "Variable Tape" is a bank of data on the elementary school districts of 
Wisconsin. 

STRATIFICATION 

The data bank provided the desired empirical basis for classifyirtg districts. The 
object of classifying districts was to form relatively homogeneous sub groups of districts. 
The composite variables were used, then, to build a multivariate stratificati )n of the 
population of Wisconsin elementary school districts. There are several possible ways 
in which the data bank could be used to construct a multivariate stratification. These 
possibilities are most distinctly differentiated by the choice of points for segmenting 
distributions of factor scores. For purposes of this project and the related illustrations, 

each of a few selected factor score distributions was simply dichotomized. 

* . , 

1 

The stratification, then, dees net utilize the composite variables in their full, 

continuous generality. Rather, strata ore defined by dichotomizing selected factor 
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scores. This sp? string of distributions is necessar/ for purposes of classifications* for 

example, it might be used for dividing the districts into groups of high or low 

Organizational Complexity. It should be noted that classification is a special case 

of measurement: it is measurement with variables which have only two values, zero 

and one. The dichotomized scores contain less information, or fewer distinctions 

between districts, than the parent distributions of the scores. 

S 



A special computer program, identified as SSRAT , was prepared to perform 
stratifications. There were three sources of input to SSRAT: they were the "Variable 
Tape", a deck of cards specially prepared from the "School /District Tape", and 
certain control information. The tape contained the image and components factor 
scores, as well as the original variables. The cards contained the actual name of each 
school district, as well as its Kind, Class, and Scope codes. The control information 

included a code to indicate whether the rotated image or the rotated components factors 

* 

were to be used, codes to indicate which factors were to be used, and codes for each 
factor to indicate whether that factor was to be dichotomized at it* mean or at Its 
median. The program first read the control information and the deck of name cards. 
Then it read the "Variable Tape", and stored the selected factor scores; and firvally the 
scores were dichotomized. 



The SSRAT program then produced its first important output, which was a 
complete list of the districts ordered by their WSDPI code numbers. This output con- 
tained four types of information for each of the 632 districts: the WSDPI code; the dis- 
trict values for Kind, Scope, and Class; the district scores on the selected stratifying 

% 

factors; and a code that indicated which classification group the district had been 

‘ ■ 1 

aisignea to* 
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A code fora classification group is called a stratum pattern^ and is in the 
form of a series of pluses and minuses. A plus indicates that a district had a score 
on the associated factor which was greater than the dichotomization point (mean or 
median of the distribution); if a district had a score below the dichotomization point, 
it was assigned a minus for Shat position in the pattern. For instance, if there are 
o^ly two factors used i(i a stratification, then all the possible stratum patterns are t 
+ + , and - -. Each district would have one and only one of these 

patterns, because it would be in one and only one classification group. 

In the case of using only two stratifyingt variables, the output ^ might look 
like this: 



ID 


Name 


Kind 


Scope 


Class 


Pattern 


Factor Scores 


0056 


Wellington 


3 


4 


2 


+ - 


2.34 - 1.67 


0063 


Dunedin 


* 2 


1 


1 


- 


-0.47 -1.22 


0070 


Timaru 


9 

3 


1 


4 


+ - 


1.46 -0.05 


0168 


Christchurch 


4 


2 


3 


+ + 


3.11 2.24 



The first column is the district identification code, and the listing is ordered by that 
code. The other columns are, from left to right: the name of tfie district; the Kind, 
Scope, and Class of the district; the stratum pattern; and the factor scores. In Hie 
actual stratification, five factors were used, instead of two. 

The second important output of SSRAT contained a section for each stratum. 

In these sections were lists of the districts in the strata, and summaries of the characteris- 
tics of the strata. The program obtained the list of the districts in each stratum by 
sorting the information in the first output according to the stratum pattern. The summary 
of chaiacteristics was then obtained after rereading the "Variable Tape" and retrieving 
and accumulating the original variables separately for each stratum. For example, th6 



^ This is a fabricated example. 

ERIC ; 



I'olal eiemenfary enrollment' in the stratum is determined as the sum of Variable 
1 over all districts with that stratum pattern. The format of this summary is given 
in Table 10; the data given in this table are the same summaries taken over all the 
districts in the state. Selected aspects of Table 10 are presented for all 32 stratum 
summaries in Appendix L, which permits direct cross-strata comparisons of summary 

I \ \ 

characteristics. 

The program SSRAT also produced two less important outputs. The first of these 
was a table of counts, which shows how many districts were assigned plus a nd how many 
were assigned minus on each factor. The second output was a list of districts ordered by 
stratification pattern codes. This list contained the same information as the first im- 
portant output, described above: district identification code, name. Kind, Oope, Class, 
and Stratum pattern. The formats of the two lists were identical; the only difference 
between them was the method for ordering districts. 



1 



District Summary Characteristics as Printed by Program SSVAR 
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PART III: SOME APPLICATIONS OF THE ALGORITHM OUTPUT 



Section A - 
Section B ** 
Section C - 
Section D -> 



illustration: Characterization and Description 
Illustration: Categorization and Comparison 
Illustration: Logical and Statistical Analysis 
Illustration: Sampling 
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part III 



SECTION A 

I LLUSTRATI ON: CHARAOERI ZATI ON AND DESCRI PTI ON 



Dis cossion in this s ection will center on report ing the exte. t to which the 
olgoririrn, provides efficient description of the dimensions olong which school district, 
voV. The known, row Indicators were the or iginol 31 voriobles defined ond discussed 
■n Section 1 1 .A. The analytic procedures discussed in Section 1 1 .C derived the five 
factors ' from the original 31 variables; these factors were weighted sums or composites, 
and accounted for at least 60% of the variance of the input data . That is, the five factor, 
compressed the input information and therefore were summary dimensions of school dis- 
frict variation. These dimensions also had the special quality of being uncorreloted. 



As summary dimensions, the factors are of intrinsic interest in understanding and 
describing the variation among school district,. So it is necessary to reach an understand- 
ing of the factors as quantitative measure, on the district,. The discussion in this section 
IS broken into two part,. First, the content, of the factors will be explored by examining 
the relationships between the facton and the original variable,. Second, the distribution, 

of the factors will be investigated for the purpose of understanding the empirical repns- 
sentation of their contents. 



FACTOR CONTENTS 



The compositions of ttae factor, are defined by the columns of the rotated image 
factor matrix. Appendix J.2, which is abstracted in Table 9. Description, of the factor 
content, are given In the following poragraphs. Special attention wo, ^iven to the variable. 



CoraLtent with Section II . C, only the flr,t five rotated image factar, are treated here. 



I 



$ 






.1 









■ ' j- 



■’i 















which had high loadings on a factor, and consideration was made of the kinds of dis- 
tricts which had simultaneously high values for those variables. Labels were assigned 
to the factors: the labels are for reference both to the operationally defined factoni and 
to the undefined constructs which they represent. In choosing the labels, considerations 
had to be given to the fact that the factors were uncorrelated. The factors and their 
labels are the results of a particular set of data and a particular analysis. For example, 
if ten more variables had been available, then the present factors might have been aug- 
merited or their structure might have been altered. 



F actor One, j ^^umerical Size. This factor was the largest one in the sense that 
if accounted for more variance than any other factor. The name was chosen because dis- 
tricts with high scores on this factor tend strongly to have large numbers of students and 
teachers at all grade levels; they also have relatively high numbers of large schools and 
are supported financially by a large total equalized property valuation. 

It would have been possible to employ statistical techniques to partial the in- 
fluence of size out of the input variables before computing the factor analysis; then no 
general size factor would have appeared. One of the objectives of this investigation, 
however, was to produce results which would be useful for a variety of purposes, and 
because size is often an important variable in studies of LEAs, no attempt was made to 

eliminate its effects. The orthogonal factorization ensured that this size factor is un- 
correlated with the other image factors. 



Factor Two, Organizational Complexity. A district with a high score on this 
factor tends to have relatively high numbers of students and teachers pe^ school. It also 
tends to have a greater degree of economic autonomy than districts with lower scores: 
tmtchers in such o district receive relotively high saiarieS, schools in the district have 



o 
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large equalized property valuaHon, and fhe district receives a high level of state aid. 

The teachers in such a district are quite variable with respect to salary and experience, 
but on the average they have more advanced academic degrees and teaching credentials 
than their colleagues from districts with a lower OrganizatioiiM* Complexity score. An 
elementary teacher from the district with the higher score on this factor is likely to 
specialize in teaching at a single grade level, but in the district there will be several 
teachers who have non-teaching duties, such cs counseling, or who teach only part time. 
Finally, districts with high scores on this factor are more likely to include a high school 
than their less complex counterparts. The relationship between the presence of a high 
school and score on Organizational Complexity ts discussed further in the notes on factor 
score distributions. 

Factor Three, Teacher Experience . The six variables with significant loadings 
on this factor are all measures of the characteristics of the teachers within districts. 

Districts with high scores on this factor have teachers who have high degrees and credentials, 
and whose averages for both local and total experience are high. There is also a relatively 
large variance among the teachers in the district with respect to the length of time they 
have taughtyboth locally and in totr'!. 

Factor ?our. School Unit Size. A district with a high score on the School Unit 
Size facti>r has a relatively high number of one-room and two-room schools; it has a 
small total equalized property valuation per school; it probably has a high school; and 
it is likely that the teachers in its schools are teaching two or more grade levels. 

Factor Five, Economic Power . Districts with high scores on.>this factor have a 
high ratio of property valuation per student, and are likely to be elementary-only districts. 
Such districts are considered economically powerful in supporting their existing school 

« 
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FACTOR SCORE DISTRIBUTIONS 



Each computed factor score hud a mean of exactly 0 and a variance of exactly 
1; that is, factor score distributions were standardized. For the purpose of illustrating 
the distributions, each is expressed here as a frequency polygon. The abcissa is identical 
for all the distributions and has been coded by mapping the range of sltindord scores into 
26^intervals. in Table is presented the coding scheme and the real limits of the inter- 
vals; the table also includes the exact interval frequencies for each factor, and the bottom 
line gives the distribution medians which were used in the stratification. The frequency 
polygons for the five factor scores are given in Figures 1, 2, 4, 5, and 6. Figure 3 is 
tbe distribution of district scores on Factor Two, Organizational Gxnplexity, distinguishing 
between those districts which include secondary schools as well as elementary schools and 
those which include just elementary schools. Three peculiarities of the distributions are 
discussed in the following paragraphs. 



Leptokurtosis of Factor One. The distribution of district scores on Factor One, 
Numerical Size, is the most non-normal of the five distributions. More than 78% of the 
entire district population resides in the interval coded number 9. If the distribution were 
normal, 9.9% of the population would be contained in this interval. But the leptokurtosis 
of the distribution does not imply that there are no reliable differences in size among 
three-quarters of Wisconsin's elementary school districts. The pile-up at the median is 
an artifact of the inclusion in the population of two or three large school systems. The 
lai^est district in the state is a metropolitan system which had a Numerical Size score of 
23.03. The probability of so large a score occuring in a normal distribution is one in 
several billion. This one district accounts for 84% of the variance of the factor. 
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1 

2 

3 

4 

.5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 



TABLE 11 



INTERVAL CODES AND EXACT FREQUENCIES 
OF FACTOR SCORE DISTRIBUTIONS 



Lower 

Limit 



-2.25 

- 2.00 

-1.75 

-1.50 

-1.25 

- 1.00 

“0.75 

-0.50 

-0.25 

0.00 

0.25 

0.50 

0.75 

1.00 

1.25 



1.50 

1.75 

2.00 

2.25 

2.50 

2.75 
3.00 

3.25 

3.50 

3.75 



4.00 



f Factor 
ributions 



Upper 

Limit 


1 


F R E 


Q U E N C 


1 E S 


II 


III 


IV 


V 


-2^01 


0 


2 


4 


2 


0 


-1.76 


0 


4 


9 


1 


2 


-1.51 


0 


27 


20 


5 


2 


-1.26 


0 


51 


30 


12 


9 


-1.01 


0 


36 


37 


36 


37 , 


-0.76 


0 


33 


47 


64 


65 


-0.51 


0 


39 


47 


73 


93 


-0.26 


37 


59 


65 


85 


79 


-0.01 


494 


61 


66 


107 


84 


0.24 


69 


84 


65 


82 


74 


0.49 


9 


69 


71 


38 


58 


0.74 


4 


46 


59 


26 


47 


0.99 


1 


27 


35 


28 


20 


1.24 


3 


26 


20 


19 


15 


1.49 


8 


22 


19 


12 


10 


1.74 


1 


11 


8 


12 


8 


1.99 


1 


13 


5 


5 


6 


2.24 


1 


4 


7 


3 


4 


2.49 


0 


5 


8 


4 


3 


2.74 


1 


6 


1 


4 


1 


2.99 


0 


3 


2 


2 


3 


3.24 


0 


2 


3 


2 


0 


3.49 


0 


1 


1 


1 


3 


3.74 


0 


1 


0 


0 


2 


3.99 


0 


0 


0 


2 


1 


23.03 


3 


0 


3 


7 


. 6 




-0.103 


0.008 


-0.024 


-0. 156 


-0. 156 



77 



It would have been possible to have excluded the large metropolitan districts 
from the input population or to have made normalizing transformations on their observations* 
Then these districts would not have been such radical outliers. But given die results as 
computed with the unaltered data, a researcher still has the choice of excluding the large 
districts. He may feel that the metropolitan school systems comprise a separate population 
and should be treated separately from all other districts; or he may believe that they are 
exVeme cases, but are,"tievertheless, members of the specified population. For example, 
in the stratification illustrated in Section lil.B, the outliers were not omitted, and con- 
sequentiy the Numerical Size scores in strata associated with "High" Numerical Size may 
vary from *-.119 to 23.03. But the discrimination between "High” and "Low" is adequate 
for the research purposes of that stratification. 



Bimodality of Factor Two . It can be seen in Figui 2 that the distribution of 
Factor Two, Organizational Complexity, has two frequency peaks— it is bimodal. In 
Figure 3, separate frequency polygons of Factor Two are displayed for the subpopulation 
of districts with high schools and the subpopulation of districts without high schools. 

These distributions have similar shape, but their peaks are located at different points. 

By comparing Figure 2 with Figure 3, it is apparent that the bimodality of Factor Two is 
a result of the mixture of the two differently centered subpopulations. This suggests, 
as is reasonable, that Organizational Complexity is manifested differently for districts 
with high schools than for districts without high schools. Stratifying the total population 
of districts nt the median of Organizational Complexity produces more a distinction be- 
tween those districts with high schools and those without than it does a distinction, within 
the subpopulations, between more or less complex organizations. 



Skewness . All the factors are somewhat skewed to the right. This is doubtless 

a consequence of skewness in the input data, for mcny of the input variables were 

enumerative. Demographic enumeration tallies tend to be skewed. All the distributions 

*■ 

should be examined carefully before cutting points for a stratification are chosen: 

. ■ '1 ' , 
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PART III 



SECTION B 

ILLUSTRATION: CATEGORIZATION AND COMPARISON 

The algorithm produced measures on the school districts which, os explained 

in the previous section, characterize the districts and which are considered important 

s, ^ 

dimensions of school district variation. The measures were the input for the stratification 
scheme presented in Section II.D. The stratification scheme produced a categorization 
of the district population which is useful in certain practical research processes such as 

stratified sampling— these applications are discussed in Section III.D. But also, the 
categorization provides c framework for substantive comparison of school districts and 
school district types. The discussion below is broken into two parts. First, further 
planation and interpretation is made of Hie categorization. Second, the summary charac- 

« t 

teristics of the categories or strata are presented, and notes are made concerning the kinds 
of substantive inference possible. 

CATEGORIZING THE DISTRICTS 

The dimensions or measures used for categorization were the first five rotated 
image factors which are discussed in Sections II.C and III .A. For each dimension the scores 
were points on a continuum, and no two districts had exactly the same score. In particular, 
no two districts had the same profile of scores. So while the multivariate analysis resulted 
in a clustering of the original 31 variables, it did not provide a direct clustering of the 632 
districts. To provide such a clustering of districts, a transformation wa» performed on the 
factor measures: each factor was coded plus for districts v/ith scores above: the median score 
and minus for districts with scores below the median score. Each district was then identified 
by a pattern or profile of five pluses or minuses. After this reduction in the amount of in- 
formation in the factor score speciflootfons, there were districts with the same foctor pattern. 



'1 



I n fdcf|, fhere are only 2^ = 32 possible pafferns, rnd so fhe population was partifioned 
info 32 cafegories. A sfrafum or cafegory of disfricfs is comprised of all the district in 
the population which have a unique pattern of five pluses and minuses. Note that the 

categories correspond to the cells In a full -factorial experimental design with five dichoto- 
mous factors. 



An objective in categorizing the districts was to put together those districts 
which were alike and to separate those which were different. However, the discussion in 
Section III. A on the distributions of the factors demonstrates that there remains considerable 



variability among, say, the 316 above-median districts for any one factor. By the time five 



such dichotomizations are superimposed on one another, the with in-stratum variability is re=^ 
uucea, Dur jr is sms to be wekrrOw! edged. The question of whether the wiiti in -stratum 
variability is greater than the between-stratum variability is an empirical one and could be 



investigated by analysis of variance procedures. But Interpretation of such an analysis would 
be difficult since the factors are no longer orthogonal after being dichotomized. This is 
demonstrated in an example in Section III.C. 



Within the algorithm it is possible to produce other stratifications. For example, 
a subset of the factors could be selected, or trichotomization could be used instead of 

dichotomization. Note that the stratification produces strata of approximately equal size be- 
cause me factors are uncorrelcied. 

COMPARING THE CATEGORIES 



The clustering and separating of districts according to the 2^ design provide a 
framework within which substantive comparisons can be made. Comparison at the level of 
individual districts is too detailed for general substantive purposes, but efficient comparison 
among strata can be based on summary characte; istics of the strata and the population. In 
Appendix L such summary characteristics are presented, and in Table 12 two pieces ex- 
tracted from the Appendix are displayed. The extract on the right of Table 12 corresponds 
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TABLE 12 

Stratum Characteristics: Extracts from Appendix L. 



+ 

+ 



+ 

+ 



17 



^.3 

94.1 

94.1 



2,331.1 



1,370.8 

960.3 



107.9 



51.0 

46.3 

10.6 



12.3 



5.9 

6.4 



59,670.6 



100 z 

107 .W 



21.6 

8.8 

25.6 

4,853.6 







Factor One 
Factor Two 
Factor Three 
Factor Four 
Factor Five 



Numerical Size 
Organizational Complexity 
Teacher Experience 
School Unit Si ze 
Economic Power 



Number of Districts 






Percent with county-based administration 
Percent which have high schools 
Percent which receive integrated aid 



Total Enrollment Per District 



Elementary enrollment per district 
Secondary enrollment per district 



Total Staff Per District 



Elementary teachers per district 
Secondary teachers per district 
Other professionals per district 



Number of Schools Per District 



Schools per district with only one or two rooms 
Schools per district wi th three or more rooms 



Equalized Valuation Per District 



S^udsnts per school in the stratum 
Students per staff in the stratum 
Staff per school in the stratum * 
Valuation per student in the stratum* 
Valuation per school in the stratum 



(Dollars x 1000) 



from fifth column of first poge 






. 



80 



STATEWIDE 

CHARACTERISTICS 

632 



79.7 

55.5 

61.3 



1,285.6 



798.0 

487.6 



58.4 



28.8 

23.9 

5c7 



4.3 



1.1 

3.0 



33,727.8 



2V5.9 

22.0 

13.4 

26.2 

7,762.6 



from fold-out tab on fourth page 
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to rtie fold-ouf fab on the fourth page of Appendix L. This contains the five factor 
identifications and the names of a series of twenty characteristics. To the right of these 
names appear the values of these characteristics for the entire state population. The 
thirty-two columns In the body of Appendix L contain the values of the characteristics 
as summarized for each of the ihirty-two strata. One of these columns, for stratum 
[jf+ appears In Tablf^ 12 and Is explained below. But the general form for a column 

Is this: at the top appears the pattern of pluses and minuses for the stratum; next Is the 
number of districts In the stratum and the proportions of districts with certain administrative 

features; then are given stratum averages related to enrollment, staff, school buildings, 
and valuation. 



1 9% * 
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4*l . 
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median for Numerical Size, Organizational Complexity, School Unit Size, and Economic 
Power; and all have factor scores below the median for Teacher Btperience. Thirty-five 
percent of the districts have county-based administration; ninety-four percent of them have 
high schools; ninety-four percent receive integrated aid. And, for example, there is an 
average of 51 elementary teachers per district in the stratum ; across the stratum, there are 21 
students per staff member; across the stratum, there is $25,600 valuation per student. 



Of particular interest in stTUtum&+ - +-0 is the large number of one and two- 



room schools. In fact, there are 101 such schools in the stratum. Yet the 



le aistricts in Hie 



ih-afum are predomlnafely city-based, have high schools, and have an abouf average amount 
of valuation per student. This suggests, as was verified by examining the districts In the 
stratum, that the districts are located In rural regions of the state but are centered In the 
business service centers of these regions. The one and two-room schools apparently are pre- 
sently maintolned in the rural outlying areas of the district. 



82 

There are 715 one and Iwo-room schools In Wisconsin, and this stratum— which 
is to say, this kind of district— accounts for almost 1/^tfi of them. Appendix L makes possible 
the substantive comparison of this stratum with other strata having large numbers of one and 
two “room schools. StratumQ- - ++ •^contains 59 one and two-room schools and its districts 
are mostly county-based, have low valuation per student, and centered in small villages. 
Stratum[j—++'2 includes 48 one and two-room schools, and its districts are similar to those 
of stratum JJ+ "• ++ / but they are smaller — that is, less of the rural area surrounding the 

villages is included. Thus by examining and comparing the strata and by making use of the 
multivariate descriptions, it can be seen Hiat there are several kinds of districts which con- 
tinue to have one and two-room country schools. ' 
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PART III 



SECTION C 

ILLUSTRATION: LOGICAL AND STATISTICAL ANALYSIS 

The algorithm produced measures on the districts which may be analyzed as 
sources of variation explaining the distributions of other variables across the district?. In 
par^cular^ the factor scones and functions of them may be used as independent variables 
in multiple regression on a dependent variable. When the scores are dichotomized, then 
the multiple regression is equivalent to analysis of variance. 



In this section, two illustrations of statistical analysis using the factor scores 
are given. In the first, the dependent variables concerned ESEA Title I fund allocation to 






• W II 



wwicwiivc wua lu ueieiiiiiiie nic uitterent.'c! prediction of the allocation 



variables from the district-characterizing measures. In the second, the dependent variable 
was viewpoint-productiveness of teachers in interviews leading to the construction of an 
item pool . The approach was to select teachers for interviewing according to an experi- 
mental design based on the dichotomized factor scores^and then to arxilyze their relative 



viewpoint-productiveness according to analysis of variance. Both illustrations are brief 
reports. In the first, the kinds of substantive inferences possible are emphasized; in the 
second, the major focus is on the kinds of experimental manipulations possible. • 



COMPENSATORY 



cru ir*ATi#’\Ki 



Problem. School districts in Wisconsin, like districts all across the country, 
are eligible to receive federal funds under Title I of Public Law 89-10, the Elementary 
and Secondary Education Act. The districts are to use their funds to support locally 



^ That is, the first five rotated image factor scores. 
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initiated programs of compensatory education for educationally disadvantaged children. 
The amount of money for which a district is eligible is the product of two factors: a 
dollar allocation rate and the number of disadvantaged children in the district. The 
allocation rate is based on county census figures and reflect, the proportion of county 
families with a basic annual income less than $2,000. 

' The objective- of the analysis was to determine the extent to which the dollar 

rate, the number of disadvantaged children, and the total Title I allocation are pre- 
dictable from the five district-characterizing factor, developed in this project. 



P rocedures . Values on the three dependent. Title I, variables were available 

one year later than the data inputs for the algorithm, and several consolidations of smaller 
districts had taken place. Also, there were a few district, which were not eligible for 
Title I funds because they had no disadvantaged children. Due to the disappearance and 
ineligibility of certain district,/ the population base for this analysis was a subset of 527 

of the 632 districts which were input to the algorithm. 

Two analyses were performed: in the first, the actual factor scores were used; 
in the second, the dichotomized scores determined for the stratification were used. The 
five district-characterizing factors, actual or dichotomized, were listed os independent 
variables and the three Title I indices were listed as dependent variables for input to a 
multiple regression analysis computer program. The program first computed the two-factor 
and three-factor interactions of the independent variables~that is, their two-way and 
three-way products. There were then a total of 25 possible independent variables: the 

five factors, the ten two-way interactions, and the ten three-way interactions. The 

analysis for the dichotomized score, i$ similar to a non-orthogonal analysis of variance. 



Resu I ts « The intercorrelaHons of the five factor scores and the three 

Title I variables are given in Table 13, The independent and dependent variables are 
separated by vertical and horizontal dashed lines. The intercorrelations from the analysis 
based on the actual factor scores are given in the upper right triangle, and the inter- 
correlations from the analysis* based on the dichotomized scores are given in the lower 
left triangle. Note that the actual factor scores are only slightly correlated; had the 
population size not been reduced, they would have been perfectly uncorrelated. 

In Table 14 are displayed the statistics describing the degree to which the 
dependent. Title I, variables are predictable from various sets of the independent, district- 
characterizing, variables. The upper half and lower half of the table present, respectively^ 
the results of tfie analysis using the actual and dichotomized factor scores. The left- 
middle, and right thirds of the table present, respectively, the results for the number of 
disadvantaged children, the allocation rate, and the total dollar allocation. Each of the 

9 

2x3 = 6 sections of the table is concerned, then, with either actual or dichotomized 

a 

factor scores and witfi one of the three dependent variables. 

In a section there are three columns of coefficients which describe the pre- 
dictability of the dependent variable from various subsets of the actual or dichotomized 
independent factors. These subsets are: each of the five factors separately, the five 
factors together, the five factors along with the two-way interactions, and the five factors 
along with the two-way and three-way interactions. The entries in the column headed R 

are multiple correlation coefficients of prediction. The entries in the column headed 

2 . . 

R are coefficients of determination — that is, proportions of the variance in the dependent 

4 

variable accounted for by tbe subsets of independent variables. The entries in rfie third 

2 2 
column are labelled A R and are tbe absolute differences between the R^ for the particular 

2 

subsets of independent variables and the R for the subset consisting of the five factors 

2 . 

together. These 6 R coefficients facilitate comparing the relative predictivenbss of the 
various subsets of independent variables, i 



TABLE 13 



Intercorre lotions of 
Stratifying Dimensions 
ond Title I Allocation Variables 





1 


2 


3 


4 


5 


1 " 
1 6 
1 


7 


8 


V 

Numerical Size 


— 


-010 


-003 


-009^ 


006 


1 964 


i37 


971 


Organizational Complexity 


062 


— 


-057 


-054 


019 


1 

007 


352 


-013 


Teacher Experience 


-185 


073 


— 


-015 


-018 


1 

1 068 


009 


054 


School Unit Size 


332 


030 


017 


— 


009 


1 

1 070 


-024 


038 


Economic Power 


-068 


-095 


-053 


-131 


— 


1 

1-055 


-048 


-038 


Eligible Children 


163 


070 


096 


022 


010 


1 

1 


128 


996 


Dollar Rate 


021 


200 


-013 


-070 


-071 


J 128 


— 


129 


Allocation 


.137 


043 


080 


-002 


024 


I 996 


129 





The intercorre lotions of the continuous variables are given in the upper right segment of 
the ^trix; intercorrelations of dichotomized variables are given in the lower left segment. 
Decimal points have been omitted. 



Table 15 gives the sfandardized regression coefficients from the analyses 
which involved multiple regressions. It is concerned only with the first two of the 
dependent yariables—the total allocation is not treated— and only with the analysis 
using the actual factor scores. For each dependent variable, three columns are given. 

In the first of these are the regression coefficients for the five stratifying dimensions, 
when only those dimensiops were used. In the second column are given the coefficients 
for the five dimensions and the two-way interactions, for the corresponding linear model; 
and the third column gives the coefficients for the regression model which used all 20 
interactions along with the five basic dimensions. Significant coefficients (p ^ .05) are 
indicated. Also given are the multiple correlation coefficient (R) and coefficient of 
determination (R ) associated with each of the regressions. 

Discussion . In the following five paragraphs, comments are made concerning 
features of the results. These comments are not intended exhaustively to explain Title I 



allocation, but rather to suggest the kinds of inferences that may be made with logical and 

sfatistica! applications of the algorithm* 

1 . It appears that the two dependent variables used in the allocation of Title 

I funds are related differentially to two stratifyinf^ dimensions. The number of disadvantaged 

children is best predicted by Factor One, Numerical Size; the dollar rate is best predicted 

by Factor Two, Organizational Complexity. 



2. The number of disadvantaged children is more predictable than the dollar 
rate. This is probably because the variation in the number' of children is largev than the 
variation in the allocation rate. Allocation rate varies by county rather than by district, 
and all the districts in a county were assigned the same rate. 

3. The results for total allocation are very similar to those for number of dis- 
advantaged children, just as the correlation between the total allocation and the number 
of disadvantaged children is h*gh. The high correlation is probably due to the difference 
in variability between the two factors which are multiplied together to determine the total 

t 

allocation. 



TABLE 15 



Standardized Regression Coefficients 
for the Relationship Between Stratification 
Dimensions and Title ! Allocation Variables 



• 


Number of Qualified Children 


Rate of Allocation 






Factors 


With 


With 


Factors 


With 


With 


Variable 


Only 


2-way 


3-way 


Only 


2-way 


3-way 


1. Numerical 


% * • 


* 


* 


* 


_ 




' Size 




.SO 


.85 


.14 


.04 


-.51 


2. OrganizationH 




* 


it 


i ^ 




* 


Complexity 


.03 


.06 


.07 


.36 


.31 


.32 


3. Teacher 


* 


it 


it 








Experience 


.07 


.10 


.11 


.03 


.02 


.08 


4. School Unit 


* 


* 


it 








Size 


.08 


.08 


.09 


.00 


.00 


.00 


5. Economic 




it 


it 








Power 


-.06 


-.06 


-.06 


-.06 


-.07 


-.10 




• 


* 


* 








1 X 2 


* 


-. 12 * 


-.34 




.06 


1.06 


1x3 




.14 


.21 




.09 


.45 


1x4 




.05* 


.12 




-.08 


-.27 


1 X 5 


♦ 


—.05* 


.05 




-.02 


-.11 


2x3 




.03 


.03 




-.07 


-.10 


2x4 




.01 


.02 




CM 

O 

• 

1 


.04 


2x5 


■ 


-.03 


-.01 




.11 


.14 


3x4 




.00 


.02 




.01 


.00 


3x5 




-.01 


-.03 




.01 


.07 


4x5 




-.01 


.01 




.07 


.12 


1x2x3 






.01* 






-.64 


1x2x4 


• 




-.27 






.44 


1x2x5 






. 01 * 






-.05 


1x3x4 






.08* 






-.09 


1x3x5 






rsry 

-.u/ 






.07 


1x4x5 






.10 






-.05 


2x3x4 






.00 






.06 


2x3x5 






-.01 


'• 




.04 


2x4x5 






-.02 


« 

s 




-.20 


3x4x5 






-.01 


% 

s 




-.07 


R . 


.97 


.98 


.99 


.38 


.40 


.42 




.94 


.97 


.97 


.15 


.16 


.18 



* The associated value is significant beyond p ^«05» 



4, When actual factor scores are used, the Inclusion of interactions 
adds little to the prediction. When dichotomized scores are used, the interoctions 
have considerable utility. 

5. There is a considerable loss of precision when dichotomized scores are 
used. Also, the dimensions are not orthogonal after dichotomization. The entries above 
theVliagonal in the uppef'left quadrant of Table 13 are not exactly zero because several 
districts which supplied inputs in the construction of factors were not included in this 
analysis. These entries, are, however, very close to zero. Certain intercorrelations of 
the dichotomized scores depart significantly from zero. 

TEACHER VIEWPOINTS 

Problem. Attitude and viewpoint item pools are often generated by collecting 

and collating information from the responses of several informants. In order to understand 

* 

the nature and composition of such pools it is necessary to ascertain the differential con- 
tributions of the sources providing information. This is exemplified in a study (Miller, 

£L* ELt' 1967) of which a major objective was to secure viewpoints of teachers concerning 
ways of facilitating classroom learning. The viewpoints were obtained by means of tape- 
recorded, open-ended interviews, and content analysis of the recordings led to the formation 
of an item pool . 

One dependent measure was viewpoint-productiveness, as measured by the 
number of statements which resulted from an interview. The analytic objective was to 

S'- 

determine how the number of discrete statements obtained from a teacher's interview protocol 

varied as a function of district factors, teacher factors, and interview factors. Altogether, 

seven factors were to be studied, and each of these factors had two levels. The first four 
» 

factors were the first four district-characterizing dimensions; factors five and six were two 
teacher variables, grade level taught and length of teaching experience; factor seven was 



the order in which the two sections of the interview schedule were administered. 



Procedures . in order to determ'ne the effecrs of the seven factors in the 
differential contributions of the teachers to the item pool, the selection of teachers to be 
interviewed and the administration of the schedule was directed according to an experi- 
mental design. The seven factors provided a basis for a 2^ design; there were seven factors 
eacl;i at two levels, defin.mg 128 cells or treatment combinations, if there were to be 
replicates for the purpose of estimating error, at least two teachers would have had to be 
interviewed within each of these cells. This would have necessitated conducting at least 
256 interviews, which was impractical given the limitations of time and cost. 



So in order to be able to estimate the effects of all seven factors without having 

7—3 

to conduct 256 depth interviews, it was decided to employ a 2 fractional factorial 
design^, which allowed reduction in the number of treatment combinations needed. Two 



teachers were interviewed in each of the 16 treatment combinations defined by a 2 



7-3 



design, and the total number of interviews called for was therefore reduced to 32. As is 



shown below, however, interpretations of the results of the consecjuent analysis are more 

7 

tentative than they would have been in the full, 2 , case. 



For the purpose of developing the fractional factorial design matrix, the levels 
of each factor were coded as + for "high" and - for "low". The seven factors for the study 



then were: 
FACTOR 



District 
Factors , 

Teacher 

Factors 

Interview 

Factor 




Numerical Size 
Organizational Complexity 
Teacher Experience 
School Unit Size 

Grade Level Taught 
Total Teaching ^perience 

Order of Interview Schedule 



CODED + 
Above Median 

II 

It 

It 

Intermediate 
10 years or more 

Order A-B 



CODED - 
Below Median 

II 

II 

II 

I 

Primary 

Ipss than 10 years 
Order B-A 



^ Fractional factorial designs are described in detail in Box and Hunter^ 1961." Their 
utility in educational research is discussed in McLean, 1966. 















In Ifie study of the relationships between the seven factors and a dependent variable, there 
are 128 sources of variation: the grand mean, 7 main effects, 21 two-factor interactions, 

35 three-factor interactions, 35 four-factor Interactions, 21 five-factor interactions, 7 six- 
factor interactions, and 1 seven-factor interaction. If there were to be only 16 treatment 
combinations, each would have to estimate the combined effect of eight of these 128 sources 
of variation. By using a fractional factorial design, the pattern of the confounding of the 
. sources of variation could be specified, within certain limits, in advance. 



This illustration is of the methodology for performing the selection and analyses. 
Therefore, the results presented in the following paragraphs am the parameters for the particu 
lar design constructed. The technical details of the construction are not given. 



Res^. The present design was structured so that the 1 x 4 interaction would 
not be confounded with any main effect. This was desirable because, as is noted in Section 
III.B, the dichotomized factor stores for factors One and Four were substantially correlated. 
Furthermore, no main effect was confounded with any Iwo-focfor interaction which was con- 
sidered possibly potent. The following are the confounding relationships for the main effects; 



I = 12345 = 1236 = 2347 = 456 = 157 = 1467 = 23567 

1 = 2345 = 236 = 12347 = 1456 = 57 = 467 = 123567 

2 = 1345 = 136 = 347 = 2456 = 1257 = 12467 = 3567 

3 = 1245 = 126 = 247 = 3456 = 1357 = 13467 = 2567 

4 = 1235 = 12346 = 237 = 56 = 1457 = 167 = 234567 



5 - 1234 - 12356 = 23457 = 46 = 17 = 14567 = 2367 

6 = 123456 = 123 = 23467 = 45 = 1567 = 147 = 2357 

7 = 123457 = 12367 = 234 = 4567 = 15 = 146 = 2356 

The first line in tfie list coniains the principal generators of the design. And, for example, 
the second fine indicates that the first main effect was confounded with fhe2x3x4x5 
interaction, the 2 x 3 x 6 interaction, etc. . 
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The actual design matrix^ac cor ding to which the teachers were selected to 
be interviewed.is given as Table 16. Each row of the table shows a treatment combination. 
For row 6, for example, it was necessary to select— at random— two districts which were 
above the median on the dimensions Numerical Size and Teacher Experience and below 
the median on the dimensions Organizational Complexity and School Unit Size. In each 



of these districts, it was necessary to identify all the teachers who had less than 10 years 
teaching experience and who taught in the intermediate grades. One of these teachers vras 



selected at random from each of the two districts, and both persons were interviewed in 
ihs same schedule order, segment A followed by segment B. 

After the interviews had been conducted and the content analyses completed, 
the statistical analysis could be computed. The structure of the ANOVA summary table 
is as follows: 



SOURCE SUMS 


OF SQUARES 


DF 


MEAN SQUARES F 


A. Numericol Size 


SSa 


1 


SSa /I 


MS^ /MS, 


B. Organizational 
Complexity 


SSb 


1 


SSg /I 


MSg/MS, 


C. Teacher 
Experience 


SSc 


1 


SSc /I 


MSc/MS, 


D. School Unit Size 


SSp 


1 


SSp/l 


MSp/MS, 


E. Grade Level 


ssg 


1 


SSp /I 


MSp/MS, 


F. Teaching Experience 


SSp 


1 


SSp /I 


MSp /MS, 


G. Schedule Order 


SSq 


1 


SS^/I 


mSq /ms 


H. All Other Controlled 
Sources 


SSh 


8 


SSh/B 


MSp, /MS, 

a 


1 . Error 


ss, 


16 


SS|/16 


A 


The analysis could be modified to test specifically the 


effect of the 1 X 4 fnteraction^alc 



with the effects confounded with it. 



• 1 ^ 









1 









$ 
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TABLE 16 
7-3 

Design Matrix for 2 Fractional Factorial 
Used in ttie Study of Teacher Viewpoint-Productiveness 



Run 

2 . 



1 2 3 4 5 6 7 

Numerical Organizational Teacher School (1234) (123) (234) 

Size Complexity Experience Unit Size Grade Personal Schedule 

level Experience Order 

+ + + + + + + 

+ + + - - + - 



3. + + 



+ 





4. 

5 . 

6 . 

7 . 

8 . 

9. 

« A 

lU, 

11 . 

12 . 

13. 

14. 

15. 

16. 





+ 



+ 

+ 



+ 

+ 

+ 

+ 



+ 

+ 

+ 



+ 



+ 

+ 



t 





& 



• n‘'j 




Discussion. The scheme described and parameterized above provides an 
efficient approach to logically analyzing the effect of the factors on the differential 
contributions of the teachers to the viewpoint item pool. The actual numerical results 
are not presented here, for the purpose of this illustration has been to demonstrate the 
utility of the outputs of the algorithm in designing statistical analyses. The scheme 
also insures variability of content, since according to the design, teachers are selected 
from different kinds of districts and with different teaching situations and experiences. 
This latter aspect of the design is discussed further in the next section. 
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SECTION D 

ILLUSTRATION: SAMPLING 

Th© mulfiyariai'e stratificat'ion defined within the algorithm provides the 
framework for many stratified sampling schemes. A distinction may be made among 
thr^e basic types of sam|i?ling objectives: sampling for analytic studies, sampling for 
enumeration studies, and sampling for maximum variance. 

Analytic studies are concerned with determining relationships among sub“ 
groups of a population and often involve analysis of variance. An example of an analytic 
sampling study was outlined in Section III.C. Enumeration sampling is performed to in- 
crease the precision with which the population can be described, and an example of such 
a study is given later In this section. Maximum variance samples are selected when con- 
cern Is for maximizing the probability of permeating a content domain. For example, if 
the responses of selected subjects were transformed into items for a factor battery, and a 
certain type of person In the population was not represented in the selected subjects, then 
items special to that type of person would be missed. Maximum variance sampling is 
exemplified by the interview study of elementary teachers' viewpoints outlined in Section 
III.C. This is explained at the conclusion of this section. 

ISSUES IN SAMPLING DESIGN 

The basic concepts and techniques of sampling theory are essential in applying 
the algorithm for obtaining stratified random samples of school districts. Most references 
on the theory of stratified sampling deal with the properties of the estimates from a stratified 
sample and with the ways for choosing stratum sample sizes so as to obtain adequate precision 
Procedures for constructing strata are usually not discussed in these references. Convereely, 



this report deals with a particular approach for constructing strata. The technicalities 
of sampling theory and formulae are not presented in this report. But, of course, careful 
attention to sampling theory is crucial in ensuring validity of a sampling application of 
the algorithm. In order to provide perspective, brief mention will be made here of four 
generally important issues of sampling theory raised in Cochran's text. Sampling Techniques 
(1^53). Other rele jnt'issues may be found in Doming (1953), Showell (1957), and 
Moonan (1953). 

Stratification. Any research study involving random sampling requires the 
identification and coding of all the members of the population. Such specification immedi- 
ately makes feasible simple random sampling, and stratified tion may seem unnecessary. 

But stratified random sampling may yield increased precision either in the sense that a certain 
precision is maintained in every stratum of the population or in the sense that estimates of 
characteristics of the total population are more precise. Such increase in precision results 
whc.i the strata are relatively homogeneous with respect to a criterion variable. In such 
a case, it is advantageous to treat each stratum as a population in its own right, and this 
is accomplished through stratified random sampling. But the homogeneity of the strata 
with respect to a particular criterion variable is rarely known in advance, and whether 
stratification is appropriate must usually be determined by substantive and theoretical 
hypothesis. Estimates of the gain in precision due to stratification may, however, be made 
after criterion data are collected. 

A secondary reason for choosing stratified random sampling may be administra- 
tive and logistic convenience. For example, in educational studies, access to individual 
schools ordinarily must be obtained from district headquarters, so stratification of schools 
into districts is naturally imposed by this administrative circumstance. Logistic problems 
of sampling and of collecting data may differ in various subdivisions of the population. 



For example, it may be practical to obtain measures on all schools in small districts, 
while it may be necessary to further sub'-sample within large districts. Perhaps con- 
versely, it may be possible to obtain measures on all schools in a geographically com- 
pact urban area while it may be costly to reach all schools in a rural area where schools 
are farther apart. 

Sample Size . The main problem ii. determining sample size is to obtain maxi- 
mum precision with minimum cost — in terms of available resources. The function relating 
precision with sample size can usually be estimated; generally it involves unknown popu- 
lation parameters. When estimated it may be substituted into the function relating sample 
size and cost, and then^given the desired level of precision, the minimum sufficient sample 
size may be computed. If cost and precision functions markedly differ for subdivisions of 
the population, then the sample size needs to be separately determined for each subdivision. 
If several characteristics are to be studied in one sample, there may be conflicting sample 
size recfuirements. These conflicts need to be resolved by considering the costs incurred 
by oversampling and the relative priorities among the characteristics. 

Geography . If geographically adjacent units are more alike than units which ■ 
are far ap>art, then it may see n reasonable to use geographical boundaries as strata 
definitions. Although sampling from such strata may be fairly efficient, the strar./ying 
dimensions are probably not directly related to the specific objectives of a sampling study. 

A sufficiently complex stratification based on theoretically relevant dimensions will, 
however, result implicitly in geographical differentiation if, indeed, there are genuine 
differences between geographical regions. The stratified random samples drawn within 
the present algorithm have, for example, been scattered throughout the state of Wisconsin. 
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Sampling Unit . The units of analysis — that is, the units which ultimately 
are measured — in some studies are selected after a series of sampling stages involved 
different units of classification. In the study outlined In Section III.C, for example, 
teachers were sampled from a stratified random sample of districts. The district level 
was chosen for the present algorithm because schools, teachers, and students form a 
nested hierarchy under districts. The utility of sampling, for example, teachers from a 
stratified random sample of districts depends on whether the teachers nested in a stratum 
f districts are relatively homogeneous. When this stage sampling is used. It is possible 
to compute how much of the variance of a measure obtained on the teachers Is due to 
individual differences among teachers and how much is due to differences In teachers 
across dish'Icts and strata of districts. 

ENUMERATION SAMPLE: TEACHING VACANCIES 

« 

Problem. Late In the summer of 1966, the Wisconsin State Department of 
Public Instruction found thot It was necessary tc determine, or to estimate accurately, the 
number of teaching positions, at all grade levels, which had not been filled for the approach- 
ing academic year. It appeared to be impossible to canvas all the districts In the state 
in the time available, so it was decided to distribute a questionnaire to a ccrefully selected 
sample of districts, to focus the resources of the WSDP! on cbtaining complete and reliable 
information from that sample, ond to estimate the state's teaching vacancies from the 
sample data. 

Procedures . The first two factors,^ Numerical Size and Organizational Com- 
plexity, were selected as stratifying variables and were dichotomized dt their medians, so 
there were four strata of districts: ++, +-, -+^ and — . 



Actually these were the factors of an early, incomplete version of fhe algorithm. 




Results, All but two of the questionnaires were returned in time for 
analysis, and the projections were computed and disseminated within ten days from the 
initiation of the survey^ 

Discussion. There was no way to validate directly the precision of the findings 
bosed on Hi is sample, but it was possible to validate it in an indirect fashion. Although 
the teaching vacancies in districts were not available as o direct check on accuracy, the 
district enrollment figures were uniformly available. As a kind of check on the sampling 
accuracy, then, enrollment figures from the same sample of 50 districts was used to pro** 
ject the statewide K-12 public school enrollment in Wisconsin. The resulting projection 
was accurate within 1 .5% of the actual statewide public school enrollment. 

SAMPLING CONTENT 

Problem. When attitude and viewpoint item pools are constructed from collated 
information gaHiered from the responses of a number of informants, it is of major concern 
that procedures for collection and collation be devised so as to ensure permeation of the 
desired content domain. One rrKznipulable aspect of the collection process is the selection 
of respondents. This selection is critical if different informants make contributions to 
different ports of the content domain. 




The content domain of interest in the study (Miller, 1967) outlined in 

Section iil.C v^as elementary teachers' viewpoints on classroom learning. An underlying 
assumption of that study was that individual teachers differ in the areas of the content 
domain to which they might contribute. This was considered plausible because teachers 
vary with respect to their life experiences, par ionality characteristics, and teaching 
backgrounds. Furthermore, it was assumed that teachers from different kinds of school 

I 

districts would t^nd to contribute to different areas of the content domain. This was 
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considered plausible because particular kinds of districts attract particular kinds of 
teachers, and the kinds of educational conditions in a district influence the professional 
experience of a teacher in a district. 

Procedures . The selection of teachers for interviewing was based on the fractional 
factorial design of Section III.C. Within the design, teachers Were stratified according to 
teaching experience and background, and the Selection of districts from which teachers 
were sampled v/as based on the district-characterizing dimensions, which include teacher 
experience characterizing information. 

Discussion . A typical approach to selection is to examine those who are most access 
ibie— that is, those who teach in districts which cooperote with research projects. To 
demonstrate the inadequacy of this routine. Table 17 has been prepared to show the factor 
scores of ten school districts from which researchers frequently solicit cooperation. Clearly 
this set of schools is not a representative sample, especially with respect to the first two 
factors. Numerical Size and Organizational Complexity. If teachers from these often 
researched districts had been interviewed in the study of viewpoints on the facilitation of 
learning, and if differences in viewpoints were correlated with Numerical Size or Organi- 
zational Complexity, important regions of the content domain might have been missed. 

Demonstration of the power of the staged stratified random teacher selection pro- 
cess in permeating the content domain was not accomplished by Miller et al., (1967) 
due to the great difficulty in quantifying the qualities of the content areas. The selection 
process was based on substantive assumptions concerning teacher experience and district 
characteristics. The selection process was inter»ded to provide maximum variance in the 
sample leading to permeating the content domain; in that sense the sample was intended 
to be representative, for effort was made to ensure that as many differen| viewpoints as 
possible would be presented. 
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TABLE 17 

Factor Scores and Summary Statistics 
for a Sample of Often" Researched Districts 
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District 


Pattern 


Factor One 


Factor Two 


Factor Three 


Factor Four 


Factor Five 


A 


+-H-- + 


^,0.92 


1,32 


0.32 


0.63 


0,47 


B 


+++- + 


1.42 


2,93 


0,84 


-0,20 


2,97 


C 


++- -H- 


0.09 


1,87 


-0,68 


0,15 


0,80 


D 


+++-- 


1.39 


1,14 


0.16 


-1,36 


-1,34 


E 


++ — + 


5.55 


1,36 


-0,76 


-0,37 


0,35 


F 


+ -+- + 


23.03 


-2,04 


0,57 


-0,64 


-0,11 


G 


++— - 


1.28 


1.72 


-0,26 


-0.89 


”0.56 


H 


++ 


4.40 . 


0,52 


-0,47 


-2.30 


-1.78 


1 


++ 


0.50' 


2,15 


-1.38 


-0.88 


-0.84 


J 


- + 


-0.10 


1.26 


-0,87 


-0.22 


-0.85 




Highest Value 


23,03 


2,93 


0.84 


0,15 


2.97 




Lowest Value 


-0,10 


-2,04 


-1,38 


-2,30 


-1.78 


- 


Mean Value 


3,80 


1.22 


-0.25 


-0,73 


-0.09 
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APPENDIX A 

This appendix contain^ the results of a correlational analysis which was 
designed to indicate special characteristics of districts with missing data. The 
initial 31 by 632 data matrix was temporarily redefined by changing the recordings 
of the first 12 variables to special dummy variables. The dummy variables have 
been called "non-missingness" variables, since they were computed to be "1" for 
entries which were not missing (in the original data) and "0" for those that were. 



Based on the redefined data matrix, means, standard deviations, and cor- 

ft 

relations were computed. The means and standard deviations are presented in 
Appendix A. 1 . The correlations are presented in Appendix A. 2. 



Of special interest are the means of the first 12 dummy variables, since they 
are the proportions of non-missing data. Also the upper left 12 by 12 portion of the 
correlation matrix indicates the predominance of Type A missing data. And the upper 
right 12 by 19 portion indicates the special characteristics of the districts with no 
missing data. 



This appendix is explained and interpreted in Section II. B. 
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MEANS AND STANDARD DEVIATIONS 



MEANS 



r 

k 



STANDARD DEVIATIONS 



1 

2 

3 

4 

5 

6 
7 
fl 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 



0.993671 
0.995253 
1. 000000 
1.000000 
1.000000 
1.000000 
0.852848 
0.856013 
0. 857595 
0.857595 
0.857595 
0. 857595 
1.797468 
1.444620 
1.746835 
487.558544 
797.996835 
28.«f27532 
6. 174051 
17. 7405 06 
2. 155063 
3.465190 
0.670886 
0.460443 
3.213608 
33727812.816406 
32552.498190 
199. 202534 
20. 849323 
9. 517363 
5380376.963135 



1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 



0. 079304 

0.068734 ' 

0.000000 

0.000000 ^ 

0.000000 ! 

0.000000 J 

0.354257 
0.351077 

0. 349465 i 

0.349465 

0.349465 I 

0. 349465 

0.401886 ; 

0.496924 I 

0.954249 

2060. 797673 | 

3224. 220010 
101.939330 

36.860129 3 

55.195788 i 

3.2140 92 1 

20.754031 I 

1.660002 ’ 

0.932035 

7. 207828 ; 

167630550.390625 

28658. 983108 i 

158. 497818 j 

4.198261 
7.386481 

5501990.301147 | 
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APPENDIX B 



This appendix contains the results of a correlational analysis of a special 

kind. The computations were based on the initial data matrix, and the results 

* >• 

served as criteria for the adequacy of replacements for missing data. Because some 
of Hie data items in the initial data matrix were missing, standard correlational 
techniques could not be applied. Instead, for each coefficient (mean, standard 
deviation, or correlation), all non-missing data available for computing the coefficient 
were used. Thus different numbers of districts are involved in the calculations of the 
different coefficients. And the results use the maximum amount possible of the in- 
formation in the initial data matrix. 



Ar>r%onr4iv R 1 le « C-.-. 

^ 1 ^ v/i \«wwiiia wi uidit PMI CUUri pulf ur VQr)UOI6S/ 

the entry equals the number of districts for which neither of the variables was missing. 
Each diagonal entry equals the number of districts for which the corresponding variable 
was not missing. Appendix 2 presents the means and standard deviations of the 
variables in the initial data matrix; each mean and standard deviation is based on a 



variable's values in all districts for which that variable was not missing. Appendix 
Be 3 is the correlation matrix of the variables; each correlation coefficient is based on 

3 ■ ■ 

ail districts for which neither of the correlates w js rriissing. 



This appendix is discussed and interpreted in Section II. C. 
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Appendix B. 2. MEANS AND STANDARD DEVIATIONS 



MEANS 



STAN DARD DEVIATIONS 
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20 
21 
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25 

26 

27 

28 

29 

30 

31 



5.549394 
3.138771 
5012.482243 
47.416935 
113.412368 
1.632026 
1.941661 
-0.792777 
12.839001 
7.530149 
8.914971 
-1.673442 
1.797468 
1. 444620 
1.746835 
487.558544 
797.996835 
28.827532 
6.174051 
17. 740506 
2. 155063 
3.465190 
0.670886 
0.460443 
3.213608 
33727812.316406 
32552.498193 
199.202534 
20.8A9323 
9. 517363 
5380376.962402 
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1. 746399 
0. 706045 
839.014015 
30. 421660 
53. 5078 13 
2. 287443 
0. 767724 

0. 838533 

1. 427974 
1. 405401 
1. 11775 
1. 5025 
0. 4018 

0. 4969^ ^ 
0. 954249 
2060. 797673 
3224. 2200 10 
101.939330 
36. 860129 
55. 1957‘='8 
3.2140 2 
20. 7540 31 
1. 660002 
0. 932035 
7. 207828 
167630550. 402344 
28658. 983093 
158. 497818 
4. 198261 
7.386481 
5501990.301270 
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APPENDIX C 



This appendix contains the results of a correlational analysis which was 
perfocmed as a necessary prerequisite for the regression analysis, which in turn 
wqs necessary for replacing the missing data. This analysis was based on the initial 
data matrix. However, before input to the analysis, the initial data matrix was 
temporarily modified by omitting from it the entries for all the districts for which 
there were any missing data. This reduced the dimensions of the initial data matrix 
from 31 by 632 to 31 by 539. The reduced data matrix had no missing entries, so it 
could be analyzed according to ordinary correlational techniques. 



in Appendix C. 1 appear the .rreans and standard deviations of the 31 variables, 
based on the 539 districts with no missing data, in Appendix C. 2 appear the correla- 
tions of the varicb I es. > 



Because districts have been omitted, this correlation matrix differs from the 
maximum information correlation matrix. Appendix B. 3. But since all the coefficients 
in this matrix are based on the same districts, unlike those of the maximum information 
correlation matrix, regression analysis could proceed from them. 

This appendix is discussed and interpreted in Section II. B. 



Appendix C. 1 . MEANS AND STANDARD DEVIATIONS 



MEANS 


L 


5.816363 


2 


3.263983 


3 


5173.263965 


4 


51.238409 


5 


116.700587 


6 


0,. 748498 


7 


1.941661 


8 


-0. 784994 


9 


12.862589 


LO 


7.544315 


LL 


8.918223 


L2 


-1.659862 


L3 


1.762523 


L4 


1.348794 


15 


1-530612 


L6 


571.682746 


17 


932.012987 


L8 


33.623377 


19 


7.239332 


20 


20.801484 


21 


2.513915 


22 


4.063080 


23 


0.632653 


24 


0.521336 


25 


3. 768089 


26 


39371783.858398 


27 


28936.453205 


28 


229.901673 


29 


21.026479 


30 


10. 968411 


31 


6133231.986572 



STANDARD DEVIATIONS 


1 


1. 386845 


2 


0. 5828 53 


3 


772.474792 
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28. 063402 
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43. 862761 
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0.831027 
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0. 767724 


8 


0.830277 


9 


1. 370490 


10 


1. 383897 


11 


1. 111709 


12 


1., 798665 


13 


0.425537 


14 


0.476589 
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3. 350476 
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22. 419175 
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7.669907 


26 


130919462. 738281 


27 


22 163. 118453 


28 


151. 78917Z 


29 


3. 188124 


30 


7. 046192 


31 


5616370. 680664 
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APPENDIX D 



This appendix presents the results of the regression arKilysis which provided 
foVmulas for replacing rfrissing data. The regression analysis was computed from the 
correlation matrix which appears as Appendix C, 2 and which is based on districts 
with no missing data. Regression analysis requires a complete data matrix. 

The analysis yielded Hie normal beta coefficients presented in Appendix D. 1. 
Each row in the 12 by 19 array gives the normal beta coefficients for the 19 variables 
13 through 31 in predicting one of the twelve variables 1 through 12, The coefficients 

are least-squares best sets for the reduced sample of districts which had no missing data. 

* 

After each row appears the squared multiple correlation coefficient of the prediction-* 
that is, the proportion of the variance of the variable (1 through 12) which is pre- 
dictable from the 19 variables (13 through 31). 

The formulas based on the beta coefficients were used in computing replace- 
ment values for the districts with missing data. 

This appendix and its application are explained in Section 11. B* 



Appendix D. 1 . NORMAL BETA COEFFICIENTS AND SQUARED MULTIPLE CORRELATIONS 
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APPENDIX E 

This appendix presents the final correlational analysis on the input data. 

The analysis is based on the complete data matrix with regression estimates substituted 
for missing data. 

In Appendix E. 1 appear the means and standard deviations of the variables^ 
and in Appendix E. 2 appears the matrix of variable intercorrelations. 

The correlation matrbc, given in Appendix E. 2 , served as input to the 
factorization procedures. 

This appendix is forther discussed in Section II. B. 
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Appendix E. 1. MEANS AND STANDARD DEVIATIONS 



MEANS 



STANDARD DEVIATIONS 



1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 



5.544972 
3. 135900 
5012.482243 
47-416935 
113.412368 
1-632026 
1.879863 
-0.791359 
12. 654994 
7.313257 
8-782369 
-1. 760688 
1. 797468 
1.444620 
1.746835 
487.558544 
797.996835 
28- 827532 
6. 174051 
17.740506 
2. 15 50 63 
3.465190 
0.670886 
0.460443 
3.213608 
33727812.816406 
32552-498190 
199.202534 
20.849323 
9.517363 
5380376.963135 



1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 
21 
22 

23 

24 

25 

26 

27 

28 

29 

30 

31 



1. 741822 
0. 705652 
839- 0140 15 
30. 421660 
53. 507813 
2.287443 
0. 735184 
0. 795903 
1.414085 
1. 409000 
1-089274 
1. 685647 
0.401886 
0- 496924 
0. 954249 
2060. 797673 
3224. 220010 
101. 939330 
36. 860129 
55. 195788 
3. 214092 
20. 754031 
1. 660002 
0. 932035 
7. 207828 
167630550. 390625 
28658. 983108 
158. 497818 
4. 198261 
7. 386481 
5501990. 301147 



Appendix E. 2 . CORRELATIONS 
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APPENDIX F 



This appendix presents the outputs from a factorization of the correlation 
> 

matrix, R, given In Appendix E. 7. The factorization scheme is that of principai 
components . 



in Appendix F. 1 are given the complete set of latent roots of the correlation 

2 



matrix; they are considered to be the diagonal entries of the diagonal mairix M 
The corresponding latent vectors of R are given as the columns of Appendix F. 2; 
this matrix is denoted Q. The factor matrix for the principal components is given in 
Appendix F. 3. It was computed according to the formula: F^ = QM. In the appendix 
the factor matrix is bordered by row and column sums of squares. The row sums of squares 
are uniformly 1 .0 because all the variance of each variable is accounted for in the 
factors. The column sums of squares are the factor variances, and are additive since 
the factors are uncorrelated. Note that the column sums of squares are equal to the 
corresponding latent roots, given in Appendix F . 1 , and are arranged in order of de- 
creasing magnitude. 



This appendix is further discussed in Section II. C. 
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Appendix F. 1. LATENT ROOTS OF R; 
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10 
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11 
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29 
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Appendix F. 3. UNROTATED PRINCIPAL COMPONENTS FACTOR MATRIX: F^ =QM 
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APPENDIX G 



The principal component factorization of the correlation matrix R was given in 



Appendix F. The factor matrix presented there provided the basis for deriving the 



noimal varimax orthogoral factorization of R. 



The factor matrix F^. given in Appendix F. 3 was subjected to the nonnal vari- 
max orthogonal rotation procedure, and the matrix was derived, where T^ is an 
orthonormal matrix and is presented in Appendix G. 1 . The matrix F^T^ is the rotated 
component factor matrix, and is presented as Appendix G. 2. The rows and columns 
are bordered by row and column sums of squares. The row sums of squares are uniformly 
1 .0, because the factors account for all the variance in each variable. The columi. 
sums of squares are the factor variances and are additive since the factors are uncor- 
related. Note that the factors have been arranged in order of decreasing variance. 



Because the factor scores for the rotated component factors were to be computed, the 

-1 



rotated component factor weight matrix, QM T, was computed. This matrix is given 
in Appendix G. 3. The columns have been normalized and give the normal weights for 
computing the factor scores from the original variables. 



This appendix is further discussed in Section II. C. 
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Appendix G. 1. NORMAL VARIMAX TRANSFORMATION MATRIX: 
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Appendix G. 3. ROTATED PRINCI PAL COMPONENTS FACTOR WEI GHT MATRIX: QM" 
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APPENDIX H 



An image analysis was perfonmed on the correlation matrix R given in Appendix 
E.*v2. In image analysi^'.each original variable is conceptually partitioned into two 
parts: the image variable, which is the original variable as predicted in linear regression 
by the other original variables; and the anti-image variable, which is the regression 
residual. The analysis then proceeds on the bash of the image variables. 

In Appendix H. 1 are presented the variances of Hie anti-image variables. These 

2 

are considered to be the diagonal entries in the diagonal matrix S , end each is the 

proportion of the variance of an original variable which is not predictable from the other 

’ ^ ^ -12' 
original variables. In Appendix H. 2 is given the matrix transformation, I - R S , 

f ■ ' ;* 

from original to image variables. Each column corresponds to an image variable and gives 
the linear equation for computing the image variable from t/ie original variables. The 

V ' 

columns In the oppendix have been normalized. In Appendix H. 3 is given the correlation ' 
matrix, SR ^ S, of the anti-image variables. 

- . . , / 

The anti-image variables are approximations to the unique factors in pure 

■ ■ i'i 

factor arxi lysis, and so they are expected to be essentially uncorrelated. 

: , ■ ' •■'■I 

This appendix is further explained in Section II • C. . 
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Appendix H. 1. VARIANCES OF ANTI-IMAGE VARIABLES: S 
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Appendix H. 2, ORIGINAL TO lAAAGE VARIABLE TRANSFORMATION MATRIX: 
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Appendix H. 3. CORRELATIONS OF ANTI-IMAGE VARIABLES: SR 
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APPENDIX I 



The image covariance matrix, G, was facfored according to fhe Harris 

2 

factofizaiion scheme. The fnatrix G is not given here, but it equals R-2S^ + 



The Harris factorization begins with the determination of tbe Harris roots of 

R, which equal the latent roots of S ^ RS \ They are given as Appendix 1 . 1, and 

2 

are considered to be the diagonal entries of the diagonal matrix , The corresponding 

“1 -1 2 2 2 9 

Harris roots of G, which equal the latent roots of S GS . are B = (B -1) B 

g ' r ' r 

and are also given in the Appendix 1.1. The Harris vectors of R and G, which equal 
the latent vectors of S RS and S ^ GS \ form the columns of the matrix X, which 
IS presented in Appendix I. 2. ' The Harris factors of G are given in Appendix I. 3, 
which is computed as F = SXB and is called the unrotated image factor matrix. In the 

g g 

appendix the rows and column are bordered by row and column sums of squares. The 
row sums of squares are the image variable variances, and the column sums of squares 
are the image variances of the factors. The order of the factors follows the order of the 
Harris roots. 



The appendix is further discussed in Section II. C. 
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Appendix 1.1. HARRIS ROOTS OF R AND G: and 
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The unrotated image factor matrix wos given in Appendix 1.3. It pro- 
viejed the basis for findii^ the normal varimax orthogonal factorization of the image 
covariance matrix G. 



The normal varimax orthogonal rotation procedure was applied to F , given 

9 

in Appendix I. 3, and the matrix FgTg was derived, where T^ is an orthonormal matrix, 
and is presented in Appendix J. 1. The matrix F T is the rotated image factor matrix, 
and is presented in Appendix J. 2. In the appendix, the rows and columns are bordered 
by row and column sums of squares. The row sums of squares are the variances of the 
image variables. The column sums of squares are the rotated image factor variances. 

Note that the factors have been arranged in order of decreasing image variance. The 
requirement for securing factor scores necessitated the computation of the rotated image 

factor weight matrix. It was computed as S"' XB"' T . In Appendix J. 3, each column 

y 

gives the normalized weights for the original variables in computing the factor score. 

This appendix is discussed further in Section II. C, 
















Appendix J. 1 . NORAAAL VARIMAX TRANSFORMATION AAATRIX 
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Appendix J. 3. ROTATED IMAGE FACTOR WEIGHTS: 
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APPENDIX K 



In order to facilitate comparison of the rotated component and the rotated 
image factor structures, the matrix of crosscorrelations beh^een the rotated image and 
the rotated component factor scores was computed. 



in Appendix K. 1 is given the matrix of crosscorrelations. The columns 

correspond to the rotated image factors and the rows correspond to the rotated component 

factors. The crosscorreiation matrix was computed as T 'M Q* RS XB T . 

f •‘ 9 

t 

This appendix is discussed further in Section il. C. 



Appendix K. 1. CORRELATIONS BETWEEN PRINCIPAL COMPONENTS FACTORS AND 
IMAGE FACTORS 
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APPENDIX L 



This appendix allows comparisons across strata of different values for 
selected choracteristics. The list of characteristics and the associated statewide 
mepn. ore given on the fold-out tab . The strata were fomied by first sepotating each 
of five rotated image factor score distributions at the median, and then identifying 
oil the districts which were members of each of the 32 unique combinations of above- 
the-median (+) and below-the-medion (-) for the set of five rotated image factor scores. 
The procedure far fanning strata is more fully described in Section II. D. 



The appendix is divided into four pages, and eight strata are described on 

each page. Each column gives the selected values for one of the strata. The pattern 

of (+) and (-) at the head of the column identifies the stratum. The next four numbers 
give the size of that stratum and the portion of the disti icis therein which fall into 



certain WSDPI classifications. The rest of the entries in the column are stratum averages, 
taken over all rtie districts in the stratum, for the selected characteristics. 





This appendix is discussed and interpreted in Section III. B. 
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Summary Measures for Stratum Chafacteri2Xition 
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Factor One - Numerical Size 

Factor Two -Organizational Complexity 

Factor Three - Teacher Experience 

Factor Four - School Unit Size 

Factor Five - Economic Power 

Number of Districts 

Percent with county-based administration 
Percent which have high schools 
Percent which receive integrated aid 



Total Enrollment Per District 

Elementary enrollment per district 
Secondary enrollment per district 



lotui Staff Per District 

Elementary teachers per district 
Secondary teachers per district 
Other professionals per district 



Number of Schools Per District 

Schools per district with only one or two rooms 
Schools per district with three or more rooms 



Equalized Valuation Per District 



Students per school in the stratum 
Students per staff in the stratum 
Staff per school in the stratum ^ 
Valuation per student in the stratum 
Valuation per school in the stratum* 



STATEWIDE 

CHARACTERISTICS 

632 

79.7 

55.5 

61.3 



1285.6 

798.0 

487.6 



58.4 

28.8 

23.9 

5.7 



4.3 

1 . 1 

3.0 

33,727.8 



295 ,9 
22.0 
13.4 
26.2 
7,762.6 



(Dollars x 1000) 
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APPENDIX M 



This appendix contains Hie full descripHons of the input- variables. 

The variables are named and numbered fo correspond with the list of input variables 
given in Table 7. 

The definitions and sources of the data represented by the variables are 
detailed in Section II. A. 
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Names and DescripHons of Initiai Input* Variables 
Variable 1; Mean Credential 

Elementary school teachers hold different kinds of teaching credentials. Ten kinds 
of credentials are recognized in Wisconsin, and for this study they were given preference 
ratios according to WSDPI criteria: the highest ratings corresponded to the highest numeric 
codes. These are the kinds of credentials and their numeric codes: 



Code Kind of Teaching Credential 

0 1 year special license 

1 1 year permit 

2 2 year license 

3 3 year license 

4 5 year term certificate 

5 4 year term certificate 

6 3 year term certificate 

7 2 year term certificate 

8 1 year license 

9 life certificate 

For a district. Variable 1, Mean Credential, is the arithmetic mean of the ratings of all the 
full-time elementary teachers in that district. 



Variable 2: Mean Degree 

Elementary school teachers differ with respect to the academic degrees which they 
have earned. The WSDPI records the highest academic degree held by a teacher, and for 
this study the degrees were given preference ratings according to WSDPI criteria: the 
highest ratings corresponded to the highest numeric codes. These are the degrees, together 
with their numeric codes: 
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Code 

1 

2 

3 

4 

5 

6 
7 
0 



Kind of Academic Degree 

less than 2 years (no diploma) 

2 years (diploma) 

3 years 
Bachelor's 
Master's 

6 years 

Doctor's 

other 



Fora district. Variable 2 , Mean Degree, is the arithmetic mean of the numeric codes of 
all full-time elementary teachers in that district.. 



Variable 3; Mean Salary 

TUg WSDP! records the salary, in doiiars per schooi year, of each teacher in Wisconsin 

Variable 3 for a district is the arithmetic mean of the salaries of all the full-time elementary 
teachers in that district. 



Variable 4; Mean Local Experience 

The WSDPI records the teoching experience, in months, of each teacher in Wisconsin 
The record is given for both local experience and total experience. Local experience is given 
by the total number of months a teacher has been teaching in the district where he is currently 
employed. For a district. Variable 4 is the arithmetic mean of the total months of locol 
teaching experience of all the full-time elementary teachers in that district. 



Variable 5; Mean Total Experience 

This variable for a district is the arithmetic mean of the total number of months of 
teaching experience, both local and elsewhere, or all the full-time elementary teachers 
in that district. 



Variable 6; Mean, Grade Spread 

A teacher might be responsible for a classroom group which includes students from 
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several grades/ for Instance/ some of his students may be In grade 2/ some In grade 3/ 
and some In grade 4. Another teacher might have only fifth grade students In a self- 
contained classroom. Variable 6 reflects the spread or range of grades of the students 
for which a teacher Is responsible. A teacher whose students are all at the some grade 
level will be assigned a score of "1". If the students are from two grade levels, a score 
of "2" will be assigned. If a teacher has students In Grades 2 through 4 , the Grade 
Spread Score Is '*3". For a district, then, this variable Is the arithmetic mean of the 
gra^e spread scores for a|l the full-time elementary teachers In that district. 



Variable 7; Log-Variance Credential 



This variable for a district Is the logarithm, base e , of Jbe variance of the credential 
codes (see Variable 1) of all the full-time elementary teachers In a district. 



Variable 8; Log-Variance Degree 

This variable for a district Is the logarithm, base e , of the variance of the degree 
code scores (see Variable 2) of all the full-time elementary teachers In that district. 



Variable 9; Log-Variance Salary 

This variable for a district Is the logarithm, base e , of tbe variance of the salaries 
In dollars (see Variable 3) of all the full-time elementary teachers in that district. 



Variable 10; Log-Variance Local Experience 

This variable for a district Is the logarithm, base e , of the variance of the years 

of local teaching experience (see Variable 4) of all the full-time elementary teachers In 
that district. 

Variable 11; Log -Variance Total Experience 

This variable for a district Is the logarithm, base e , of the variance of the total 

years of teaching experience (see Variable 5) of all the full-time elementary teachers In 
that district. 
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Variable 12t Log-Variance Grade Spread 



This variable for a disfricf is fhe logarithm, base e , of the variance of the spread 
in grades taught (see Variable 6) by all the full-time elementary teachers in that district* 



Variable 13: Kind 



^ This variable indicates the general kind of administrative structure un^er which a 
school district operates. Yhe WSDPI uses a seven -category coding scheme (See Table 1, 
Part II). Each category indicates a slightly different tax base. For research purposes 
the WSDPI scheme was reduced to a two-category scheme, wherein a "1" was assigned to 
city-based districts, and a "2‘* to county-based districts. These codes are inverted with 
respect to preference ratings assigned by the WSDPI (see Table 1). The lower value corres* 
ponds to the higher WSDPI rating. 



Variable 14; Scope 



This variable designates .whether or not a district has a high school . The variable 
is "1" for districts with one or more high schools and "2" for districts with no high schools. 
Note that these codes are inverted with respect to preference ratings assigned by the WSDPI 
(see Table 1). The lower value corresponds to the higher WSDPI rating. 



Variable 15: Class 



This variable indicates the level of state aid a district receives. The WSDPI dis- 
tributes state aid according to three classifications, each of which has a specific set of 
criteria. The three classes of state financial aids are: "Integrated", the class of districts 
which receives the highest rate of aid distribution; "Basic with Integrated", the class which 
receives the second highest rate of aid distribution; and "Basic", the class receiving the 
lowest rate of aid. For research purposes, numeric codes were assigned to the three classes 
as follows: "1" for Integrated, "2" for Basic with Integrated, and "3" for Basic. 



Variable 16; Secondary Enrollment 



> This variable is the total student enrollment of all the secondary schools in the dis- 
trict. It is coded "0" if there is no secondary school in the district. 
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Variable 17; Elementary Enrollment’ 

This variable is the total student enrollment of all the elementary schools in the 
district. Since only districts with some elementary students were included in this study, 
the variable is always greater than zero. 

Variable 18; Number of Elementary Teachers 

This vcriable fora district is its total number of all full-time elementary school 
teachers. 

Variable 19; Number of Junior High School Teachers 

This variable for a district is its total number of all junior high school teachers. 
Included are teachers who teach some elementary students, as well as some junior high. 

Variable 20; Number of High School Teachers 

f 

This variable is the total number of all high school teachers in a district. Also in- 
cluded are teachers who teach some elementary or junior high grades, as well as some high 
school . 

Variable 21 ; Number of Other Teachers 



This variable for a district is the total number of all teachers in the district who 
have not been counted in Variables 18, 19, or 20. Included are part-time teachers, and 
administrators who teach. 

Variable 22; Number of Other Professiona ls 

This variable for a district is the total number of all professional staff in the district 
who were not counted in variables 18, 19, 20, or 21 . Included are those professional 
employees with no teaching duties. Note that the sum of Variables 18 to 22 is the total 
number of professional employees in a district. 
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Variable 23; Number of One-Room Schools 

This variable Is the total number of schools In a district v/Ith only one teacher, 
and Is, by Inference, the number of one-room schools In that district. 

Variable 24t Number of Two" oom Schoo ls 

This variable Is the total number of schools In a district with exactly two teachers, 
and Is, by Inference, the^number of two-room schools in that district. 

Variable 25: Number of Three-or-More-Room Schools 

This variable for a district Is the total number of Its schools with three or more 
teachers, and Is, by Inference, the number of three-or-more room schools. Note that ifie 
sum of Variables 23 to 24 Is the total number of schools In a district. 

Variable 26; Equalized Valuation 

This variable Is the equalized valuation In dollars of a district. It Is the standardized 
property wealth of a district, as determined by the Wisconsin State Department of Taxation. 

The variable affects the distribution of state financial aid to the school district. 

Variable 27: Valuation/Student 

This variable Is the ratio of the equalized dollar valuation of a district to the number 
of students In that district. Note that this variable Is equal to the ratio of Variable 26 to 
the sum of Variables 16 and 17. 

Variable 28: Students/School 

This variable Is the ratio of the number of students In a district to the number of 
schools In that district. Note that this variable Is the ratio of sum of Variables 16 and 17 
to the sum of Variables 23, 24, and 25. 
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Variable 29; Students/Staff 

This variable is the ratio of the number of students in a district to the number of 
professional employees in that district. Note that Variable 29 is the ratio of the sum 
of Variables 16 and 17 to the sum of Variables 18 to 22. 

Variable 30; Staff/School 

This variable is the ratio of the number of professional employees in a district to 
the number of schools in that district. Note that Variable 30 is the ratio of the sum of 
Variables 18 to 22 to the sum of Variables 23 to 25. 

Variable 31: Valuation/School 

This variable is the ratio of the equalized dollar valuation of a district to tbe number 
of schools in that district. Note that this variable is the ratio of Variable 26 to the sum of 
Variables 23 to 25. 
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